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NUCLEIC ACID VACCINES USING TUMOR ANTIGEN ENCODING NUCLEIC ACIDS 

WITH CYTOKINE AD JUVANT ENCODING NUCLEIC ACID 

FIELD OF THE INVENTION 

The present invention relates to nucleic acid vaccines comprising sequences that encode a tumor 
antigen as an immunogen and a cytokine as an adjuvant. The vaccines are suitable for the 
vaccination of rnanirrials, including humans, in order to provide unexpectedly enhanced cellular 
and/or humoral immune responses to one or more tumor related pathologies. Additionally, the 
invention relates to methods for making and using such nucleic acid vaccines. 

BACKGROUND OF THE INVENTION 

Cancer is a serious disease that afflicts one in four people. In the last fifty years, there have been 
significant improvements in the early detection of cancer, as well as the development of a number 
of therapies to treat cancer. Therapies include surgery to remove primary tumors, and sublethal 
radiation and chemotherapy to treat disseminated disease. While these treatments have resulted in 
apparent cures for many patients, the treatments can be quite debilitating and are still often 
ineffective at preventing death from this disease. There is clearly a need for therapies that are less 
destructive, as well as for novel therapies that harness the body's natural defenses to fight cancer. 

Cancer can be divided into two classifications, depending upon the cell type the tumor is derived 
from. For example, carcinomas are derived from epithelial cells, while sarcomas are derived from 
mesodermal tissues. Some epithelial tumors express on their surface a protein called mucin 1 
(MUC1). 

MUC1 is a transmembrane protein that is normally expressed in non-disease states on ductal 
epithelial cells, such as those in the intestinal mucosa exposed to the lumen of the small intestine. 
The most notable feature of MUC1 is its large extracellular domain, which is comprised of 30-100 
tandem repeats of a 20 amino acid sequence. The tandem repeats confer a rigid structure to this 
portion of the protein, and the repeats are a substrate for heavy glycosylation. In addition, in 
normal cells MUC1 is only expressed on the ductal side of the cell. It is thought that MUC1 may 
provide a lubrication function to the duct, and it may also be involved in signal transduction. 
Because the protein is normally expressed on the ductal side of cells, it is rarely exposed to the 
outside of the organism, and is considered a "sequestered antigen", because in its native form 
MUC1 is not exposed to immune system surveillance. 
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In contrast, MUC1 expression is different in epithelial tumors. The protein becomes 
overexpressed and is present all over the surface of the cell, and it is relatively deglycosylated as 
compared to the normal form expressed in ductal epithelial cells. Thus, the distribution and 
pattern of expression is very different in normal and neoplastic tissues, and the deglycosylated, 
aberrant protein exposes novel epitopes to the immune system. Because the pattern of expression 
is different from normal, it is possible that the immune system can now recognize the tumor- 
associated MUC1 as foreign and attempt to destroy the cells expressing this protein* Indeed, the 
immune system does appear to act in this way in some cancer patients. It has been shown that 
patients with ovarian, breast or pancreatic cancer possess weak antibody and cytotoxic T 
lymphocyte (CTL) responses to MUC1 , indicating that their immune systems do indeed recognize 
a difference in the tumor-associated MUC1. However, the immune responses are clearly not 
strong enough to eliminate tumor cells. 

These observations have led some investigators to develop therapeutic strategies designed to 
induce or strengthen the natural immune response. For example, several groups have attempted to 
use MUC1 peptides to prime a cellular response in patients. This relies on the concept that cells 
could process the peptide and present it in the context of Class I molecules to the immune system, 
to cause a Thl response to cells expressing the MUC1 protein. There are several disadvantages to 
known approaches. First, peptides have short half-lives, requiring administration of large 
amounts of the peptide. Second, each person expresses several Class I molecules and a given 
peptide binds to only one molecule, which will be held by a minority of the patient population. 
Third, the immunity generated by such approaches may not be relevant to treating such cancers; it 
has been noted that anti-peptide immunity can be generated by peptide immunization, which does 
not always lead to anti-protein immunity. 

The identification of tumor-specific antigens has supported the concept that immunologic 

strategies could be designed to specifically target tumor cells in cancer patients. Immunologic 

recognition of tumor antigens has been subsequently documented in patients with malignancy. 

However, these responses are muted and are ineffective in eradicating disease . The development 

of immune tolerance towards malignant cells is due, in part, to the inability of tumor cells to 

effectively present antigens to the immune system. Therefore, T cells with the capability of 

recognizing these antigens fail to become activated A major focus of cancer immunotherapy has 

been the attempt to introduce tumor antigens into the cancer bearing host such that they may be 

recognized more effectively and that meaningful antitumor responses can be generated. In this 

way, native immunity directed against antigens selective for or over-expressed in malignant cells 

may be amplified and result in tumor rejection. Approaches to induce tumor-specific immunity 

have included vaccination with tumor cell extracts, irradiated cells, tumor-specific peptides with 
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and without adjuvant, and dendritic cells (DC) pulsed with tumor peptides/proteins, or 
manipulated to express tumor-specific genes. 

DNA immunization has been used as a method to generate immune responses in vivo, and has 
been recognized as an effective way to generate cytotoxic T cells directed against an encoded 
antigen. Vaccination with tumor-specific naked DNA results in the expression of tumor antigens 
by the inoculated muscle cells. Professional antigen presenting cells, in particular DC, recruited to 
the site of injection, internalize and subsequently present the tumor-specific antigens at sites of T- 
cell traffic. 

Breast cancer is a common malignancy second only to lung cancer among cancer deaths in 
women. In 2000, it was estimated that 182,800 new cases were diagnosed and 41,200 deaths 
resulted from breast cancer in the United States (US). Standard-dose combination chemotherapy 
can yield high response rates in previously untreated patients with metastatic disease, but complete 
responses are rare. Despite initial chemosensitivity, median disease response duration is less than 
1 year due to the emergence of chemoresistant disease. The median survival for patients with 
metastatic disease has remained approximately 2 years for those treated with standard-dose 
chemotherapy. A majority of breast carcinomas express MUC1, As noted in the Investigator's 
Brochure, responses to recombinant vaccine constructs expressing MUC1 have been shown to 
induce immune responses in mice and chimpanzees. As such, immunotherapeutic strategies 
targeting the MUC1 antigen are a potentially promising approach for patients with metastatic 
breast cancer who otherwise lack effective treatment options. 

Prostate cancer is the second leading cause of cancer-related death in men. Approximately 
1 80,000 men will be diagnosed with prostate cancer each year, and 40,000 succumb to the disease 
each year. Prostate tumor cells have a low proliferation rate and do not respond to standard 
chemotherapies, which are most toxic to the most rapidly dividing cells in the body. Instead, 
prostate cancer can be treated surgically, with radiation therapy or hormonal therapy. Surgery and 
radiation therapy can lead to undesirable side effects, such as incontinence and impotence, The 
disease can often be successfully managed with hormonal therapy, which starves the cells for its 
required growth factors. However, eventually all tumors treated in this way become androgen- 
independent and there is no effective treatment beyond that point. There is clearly an unmet 
medical need to treat this disease more effectively, and with novel therapies. 

One such approach that has considerable promise is active immunotherapy. Active 
immunotherapy would stimulate the patient's immune system to generate an anti-tumor response 
that could help hold the disease in check longer, or even rid the patient of metastatic disease. One 
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example of active immunotherapy include dendritic cell therapies, where the patient's professional 
antigen presenting cells are removed and pulsed with tumor antigen, transfected with tumor 
RNA/cDNA, or fused with tumor cells. The ex vivo-treated dendritic cells are then reinjected into 
the patient, and are expected to drive a prostate-tumor specific immune response. One 
disadvantage of such approaches is that they amount to designer therapy that would he very costly 
and require very specialized skills to administer. Such therapies are unlikely in their current form 
to be widely used. 

A second active immunotherapy approach is peptide vaccination. In this approach, tumor-specific 
peptides or proteins are administered to the patient, with the hope of directly loading antigen- 
presenting ceils in vivo. This approach is more likely to be usable in the clinic than the ex vivo 
approach described above, but consistent success has not yet been achieved with this strategy. 
Some problems include that fact that peptides are short-lived in vivo, and therefore require very 
large doses. In some clinical trials, peptide vaccination engenders anti-peptide immune responses 
that do not translate into responses against tumors expressing the whole protein from which the 
peptides were derived. 

A third active immunotherapy approach that has much more promise to be widely used would be a 
cancer vaccine. Specifically, we believe that a DNA vaccination approach could be very effective 
in treating prostate cancer patients. In this treatment, the vaccine would be comprised of plasmids 
(or other DNA-containing agents) that encode antigen(s) specific to prostate cancer. The plasmids 
would be injected into the patient, and the prostate-specific antigens would then be expressed and 
presented to the immune system. The antigen-presentation process would engender a specific 
cellular and/or humoral response that could help to control the growth of the tumor or its 
metastases. From preclinical models there is reason to believe that such an approach could be 
effective. For example, vaccination of rhesus monkeys with DNA vaccines encoding PSA +/- 
cytokine adjuvants drives PSA-specific humoral responses and cellular proliferation. Jh two male 
monkeys vaccinated in this way, there was evidence of infiltrating cells within the prostate post 
vaccination, but not in a non vaccinated control, m work in our labs, we have shown that 
vaccination with DNA encoding a different tumor associated antigen, MUC1, can lead to immune 
responses protective against tumor challenge with MUCl-expressing tumors. Thus, it may be 
possible to use DNA vaccines to break tolerance to self-antigens that happen to be strongly 
expressed by tumors, and mount a therapeutic immune response. 

While vaccination with PSA with or without cytokine adjuvants may very well be effective as an 
immunotherapy, it is possible that this would not be enough.to control tumor growth. It is entirely 
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possible that an effective immune response against PSA would eliminate PSA+ tumor cells but 
leave PSA- prostate tumor cells intact and able to grow unfettered. Therefore, it may be desirable 
to vaccinate with more than one tumor antigen. We propose that a DNA vaccine comprised of the 
PSA antigen with other antigens expressed highly in prostate cancer, such as KLK2 and/or MUC1, 
and perhaps with other adjuvant/costimulatory genes, would be a more effective approach than 
vaccination with a single antigen. 

PSA or KLK3 is a member of a multigene family known as the human kallikrein gene family. 
There are 15 closely related genes in the family, all of which map to a 300kb region of human 
chromosome 19ql3.3-ql3 A Kallikreins are secreted serine proteases. All are synthesized as 
preproenzymes; proenzymes arise after removal of the signal peptide, and the mature active 
protease arises after removal of a propeptide. The activity of a given kallikrein will be either 
trypsin-like or chymotrypsin-like, depending upon the nature of the active site. PSA or KLK3 is a 
30 Kd serine protease with chymotrypsin-like activity, which is responsible for cleaving 
seminogelin I, seminogelin II and fibronectin in seminal fluid. PSA is most highly expressed in 
the prostate, but it is also expressed at lower levels in breast, salivary gland, and thyroid. Besides 
prostate cancer, PSA is expressed in some breast malignancies. PSA has become well known as a 
serum marker for prostate cancer; it is a very important diagnostic for this disease and increasing 
serum levels of PSA typically correlate well with the severity of the disease. Expression of PSA is 
not increased in prostate cancer cells versus normal prostate cells; instead as the disease breaches 
the normal cellular barriers, PSA leaks into the serum. It is unclear if PSA has a role in the 
etiology of prostate cancer; various reports have indicated that PSA could either enhance or inhibit 
tumorigenicity. Several CTL epitopes for PSA have been described for the HLA A2 and A3 
haplotypes; identification of these epitopes support the possibility of generating therapeutic in 
vivo CTL by vaccination. 

KLK2 is the member of the kallikrein family that most closely resembles PSA, with about 80% 
identity at the amino acid level. Like PSA, KLK2 is expressed highly in the prostate and in 
prostate cancer, with lower levels of expression in other tissues, such as breast, thyroid, and 
salivary gland. KLK2 has trypsin-like activity, and one of its activities is to cleave the proenzyme 
form of PSA to yield the mature enzyme. There is increasing recognition that KLK2 may be a 
good serum prognostic indicator to monitor the progress of prostate cancer patients, although it is 
likely to be a supportive diagnostic along with PSA. 



5 



WO 03/031569 



PCTAJS02/2964O 



Accordingly, there is a long-felt and pressing need to discover vaccines and methods that elicit 
an immune response that is sufficient to treat or prevent various tumor related human 
pathologies. 

SUMMARY OF THE INVENTION 

The present invention is intended to overcome one or more deficiencies of the related arts. In 
particular, nucleic acid vaccines of the present invention advantageously provide a more robust 
immune response. The strength of the present invention lies in its power to recruit one or more 
of B cell, helper T cell, and cytotoxic T cell components of the immune response for effective 
humoral and cellular immunity. 

To provide more effective tumor or cancer vaccines, the present invention provides nucleic acid 
vaccines comprising a cancer-specific or tumor-specific antigen nucleic acid and an adjuvant 
nucleic acid. Also provided are methods of making and using such nucleic acid vaccines. In 
their use as a vaccine, the co-expression of tumor nucleic acid and the adjuvant nucleic acid in 
a tissue to which the vaccine of the present invention has been introduced induces a cellular or 
humoral immune response, or any component thereof, to the tumor protein or fragment thereof 

This invention uses nucleic acids (or fragments thereof) encoding such tumor antigens as, but not 
limited to, prostrate specific antigen (PSA), KLK2, and/or mucin- 1 (MUC1) as antigen 
components of a DNA vaccine for tumors, such as but not limited to, any PSA, KLK2 or MUC-1 
associated tumor or cancer. The antigen genes will be of human origin, or mutated to enhance their 
immunogenicity. Examples of how the antigen genes could be rendered more immunogenic 
would include alteration or removal of signal sequences required for secretion, optimization of 
codons for improved translation, addition of ubiquitination signals for degradation, addition of 
subcellular compartment targeting sequences, addition of molecular chaperone sequences, and 
optimization of CTL epitopes. The antigen genes could be fused together to increase 
immunogenicity. The CTL/helper epitopes could be linked together, or inserted as part of another 
molecule, such as an immunoglobulin molecule. 

Other genes may also be included in the vaccine, including cytokine adjuvant genes such as EL- 18, 
IL-12 or GM-CSF, or genes for costimulatory molecules such as B7-1, which would help to drive 
the immune response. 

The genes of the invention could be encoded by plasmids, viruses, bacteria or mammalian cells. 
The vaccination regimen could be comprised of any or all of these agents, such as a-plasmid DNA 
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priming vaccination, followed by a viral vector boost. The latter approach appears to be effective 
in generating cellular responses important in controlling infectious diseases (28-32), and may be 
very useful in anti-cancer applications of this technology as well. 

In the vaccines of the invention, the tumor encoding nucleic acid may be isolated from patients 
having a tumor related cancer, preferably from the cancerous tissue itself or from mRNA or 
cDNA encoding a cancer-related tumor protein or antigenic portion thereof. 

The present inventors have discovered that nucleic acid vaccines of the present invention elicit 
unexpectedly enhanced immune responses by the expression and/or presentation of at least one 
tumor antigen encoding nucleic acid and at least one cytokine adjuvant encoding nucleic acid. 

The present invention also provides at least one tumor/adjuvant nucleic acid encoding (or 
complementary to) at least one antigenic determinant encoding nucleic acid of at least one 
tumor protein and at least one adjuvant encoding nucleic acid of at least one portion of an IL-1 8 
protein. 

The present invention also provides a tumor/adjuvant vaccine composition comprising a 
tumor/adjuvant nucleic acid vaccine of the present invention, and a pharmaceutically 
acceptable carrier or diluent. The vaccine composition can further comprise an additional 
adjuvant and/or cytokine encoding sequence or further component of the composition which 
enhances a nucleic acid vaccine immune response to at least one cancer associated tumor 
protein in a mammal administered the vaccine composition. A nucleic acid vaccine of the 
present invention is capable of inducing an immune response inclusive of at least one of a 
humoral immune response (e.g., antibodies) and a cellular immune response (e,g,, activation of 
B cells, helper T cells, and cytotoxic T cells (CTLs)), with a cellular immune response 
preferred. 

The present invention also provides a method for eliciting an immune response to a cancer 
associated tumor protein in a mammal which is prophylactic for a cancer associated tumor 
protein, the method comprising administering to a mammal a vaccine composition comprising a 
nucleic acid vaccine of the present invention, which is protective for the mammal against a 
clinical MCU-1 -related pathology. 

The present invention also provides a method for eliciting an immune response to a cancer 
associated tumor protein in a mammal for therapy of a tumor-associated pathology, such as but 
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not limited to a tumor or cancer. The method comprises administering to a mammal a 
composition comprising a nucleic acid vaccine of the present invention, which composition 
elicits an enhanced immune response, relative to controls, in the mammal against a clinical 
tumor related pathology. 

In a further embodiment, the prophylactic or therapeutic method of eliciting an immune 
response to tumor comprising administering an effective amount of another (e.g., second) 
nucleic acid vaccine comprising at least 1 to about 100 different tumor protien fragments or 
variants, in which the fragments or variants relate to different tumor nucleic acid or amino 
sequences, preferably related to a cancer-associated or pathology-associated tumor protien or 
antigen sequence. 

The tumor-specific immune response generated with at least one nucleic acid vaccine of the 
invention can be further augmented by priming or boosting a humoral or cellular immune 
response, or both, by administering an effective amount of at least one tumor/adjuvant vaccine. 
Any of the vaccine strategies provided herein or known in the art can be provided in any order. 
For example, a subject may be primed with a nucleic acid vaccine, followed by boosting with a 
nucleic acid vaccine or a protein vaccine. Preferably, the tumor/adjuvant vaccine is 
administered intramuscularly. Preferably, the vaccine is in the form of a plasmid and is 
administered with a gene gun or injector pen, needled or needleless. However, other forms and 
administration are also suitable and included in the present invention. 

The present invention also provides methods, compositions, articles of manufacture and the like, 
for making and using a tumor/adjuvant nucleic acid vaccine of the present invention. 

Other objects, features, advantages, utilities and embodiments of the present invention will be 
apparent to skilled practitioners from the following detailed description and examples relating to 
the present invention, in combination with what is known in the art. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Female C57B1/6 mice were vaccinated three times (Day —28, -14, and —7) with buffer, 
empty vector, pMUCl plasmid, pIL-18 plasmid, or combinations of the latter two plasmids. 
Animals were challenged with MUC1+ mouse tumor cells on Day 0, and were monitored for 
tumor incidence for 50 days. 
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Figure 2. Female C57B1/6 mice were vaccinated three times (Day -28, -14 and ~7) with buffer, 
empty vector, pMUCl plasmid, pEL-18 plasmid, or combinations of the latter two plasmids. 
Animals were challenged with MUC1+ mouse tumor cells on Day 0, and were monitored for 
tumor growth for up to 50 days. 

Figure 3, C57BI/6 mice free of tumors in Figure 1 were rechallenged with MUC1 + tumor cells 
on Day 49 {denoted Day 0 in this figure). Mice were monitored an additional 49 days after the 
second tumor challenge. 

Figure 4. MUC1 Tg mice were vaccinated three times (Day -28, -14, and -7) with the 
plasmids indicated in the legend. Mice were challenged with MUC1+ tumor cells on Day 0 and 
monitored for tumor incidence for 28 days. 

Figure 5. Animals from Figure 4 were sacrificed, and their tumors were excised and weighed 
on Day 28 after tumor challenge. Horizontal bars are median values. 

Figure 6. Phase H of the pMUCl/pIL-18 vaccination of MUC1 Tg mice. MUC1 Tg mice 
without tumors at the end of Phase I {Figure 4) were rechallenged with a second dose of MUC1+ 
tumor cells on Day 50 after the first challenge (denoted Day 0 in this figure). Mice were 
monitored for tumor incidence for 28 days after the second challenge. 

Figure 7- Remaining tumor-free MUC1 Tg mice from Phase II {Figure 6) were challenged on 
Day 28 of Phase II with MUC1' parental tumor cells {denoted as Day 0 in this figure). Animals 
were monitored for tumor incidence 39 days post challenge. 

Figure 8A-C- A, DNA sequence of human IL-18plasmid pi 968 with the protein sequence of 
Figure 8B included, B, C. Protein sequence of the precursor human IL-18 produced by the 
engineered IL-18 constructs. The first 19 residues are derived from the 12B75 HC signal 
sequence; the remaining 161 residues are the mature human IL-18. In the version shown in C 5 the 
first residue of the mature human EL-18 sequence is altered to better conform to consensus human 
immunoglobulin signal sequences. 

Figure 9A-D: Sequence of human MUC1 cDNA with intron 6 incorporated. 

Figure 10. Tumor incidence in female MUC1 transgenic mice vaccinated with DNA as 
indicated in the legend, and subsequently challenged with MUC1 + tumor cells. Only the group 
vaccinated with pMUCl/pIL-1 8 shows significantly improved protection from tumor challenge 
(p=0.007). 

Figure 11. Media tumor weights at study end, from animals shown in Figure I. Media tumor 
weight for group 4 is significantly different from those in the other groups. 
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Figure 12. Rechallenge of protected mice from Figure 1 with MUC1" tumor cells. 

Figure 13. Tumor incidence in male mice vaccinated with pMUCl or empty vector, followed 

by tumor challenge. 

Figure 14. Tumor weights in male mice vaccinated with pMUCL 

Figure 15, Tumor incidence in male mice rechallenged on the opposite flank with MUC1+ tumor 
cells. 

DETAILED DESCRIPTION OF THE DISCLOSURE 

The present inventors have discovered that unexpectedly enhanced immune responses can be 
induced against tumor associated pathologies, by the use of nucleic acid vaccines that contain a 
combination of at least one tumor antigen or protein encoding nucleic acid and at least one 
cytokine encoding nucleic acid. 

The terms "priming" or "primary" and "boost" or "boosting" are used herein to refer to the 
initial and subsequent immunizations, respectively, i.e., in accordance with the definitions these 
terms normally have in immunology. 

The component encoding nucleic acids of a tumor/adjuvant encoding nucleic acid of the present 
invention can be provided using any known method or source. Alternatively, the different 
tumor nucleic acids can be obtained from any source and selected based on screening of the 
sequences for differences in coding sequence or by evaluating differences in elicited humoral 
and/or cellular immune responses to multiple tumor sequences, in vitro or in vivo, according to 
known methods. 

As is readily appreciated by one of skill in the art, the inventors have further found that 
boosting with a tumor/adjuvant vaccine of the present invention further potentiates the 
immunization methods of the invention. The tumor protein(s) encoded by the nucleic acid 
vaccine can be similar or different different to the tumor protein(s) in the boosters. 

Similarly, as can be appreciated by the skilled artisan, the immunization methods of the present 
invention are enhanced by use of primer, booster or additional administrations of a DNA 
vaccine of the present invention. The tumor/adjuvant vaccine can be used as a boost, e,g., as 
described above with respect to the tumor proteins. Alternatively, the vaccine can be used to 
prime immunity, with the vaccine or vaccines used to boost the anti-tumor immune response. 
The vaccine may comprise one or more vectors for expression of one or more tumor proteins 
or portions thereof. In a preferred embodiment, vectors are prepared for expression as part of a 
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DNA vaccine. 

The invention is a therapeutic vaccine that would be used in patients with cancer, where PSA 
and/or KLK2 and/or MUCi are uniquely expressed, or overexpressed relative to normal tissue. 
The vaccine could potentially be preventative therapy for individuals at high risk of developing 
prostate or other cancers or tumors expressing these antigens. The vaccine could also be used in 
other cancers where PSA and/or KLK2 and/or MUCI are either uniquely expressed or 
overexpressed relative to normal tissue. The vaccine would be comprised of DNA encoding any 
combination of these antigens, and could be contained within one or more plasmids, mammalian 
viruses, bacteria or mammalian cells. The antigen or adjuvant encoding nucleic acids as one or 
more components of the vaccine could include any alternatively spliced forms that naturally occur. 
The antigen genes may contain modified sequences that will include optimized codons for 
translation in human cells, or signals for ubiquitination that would lead to enhanced degradation. 
The vaccine could contain fragments of the antigen genes, including antigen-specific CTL 
epitopes linked to each other, or to other heterologous CTL epitopes and/or 
homologous/heterologous CD4 helper epitopes. Fragments of the antigen genes could be 
generated that lack signal sequences, which could enhance degradation and antigen presentation. 
Fragments of the antigen genes could be encoded as fusions with other proteins, ot inserted within 
other protein sequences, such as immunoglobulin sequences. Natural variant sequences have been 
reported for PSA, KLK2 and MUCI, and are useful in the present invention, e.g., but not limited 
to those presented in SEQ ED NOS: 1-47, and specified variants thereof. 

The vaccination regimen could include a mixture of DNA-encoding agents, temporally 
administered in different orders, or administered in different places in the body at the same time. 
Plasmids could be formulated in lipid, buffer or other excipients or chemical adjuvants that could 
aid delivery of DNA, maintain its integrity in vivo, or enhance the immunogenicity of the vaccine. 
The vaccine could also be delivered by direct injection into muscle, skin, lymph node, or by 
application to mucosal surfaces. Other potential modes of delivery would include injection of 
DNA, followed by electroporation to enhance cellular uptake and expression of DNA. 

One possible cytokine adjuvant that could be included in the vaccine is human EL- 18. Variants of 
human IL-1 8 sequence have been reported, , e.g,, but not limited to those presented in SEQ ID 
NOS:60-77, and specified variants thereof The macaque sequence for IL-1 8 is very similar to 
human EL-18, and can also be used according to the present invention. 
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The antigen genes, or costimulatoiy molecule genes, or cytokine adjuvant genes would be 
expressible in humans because of being linked to a promoter. The genes would also be 
expressible because of linkage to a polyadenylation signal, such as the SV40 late polyadenylation 
signal. An intron may be included for enhanced expression, such as the HCMV IE intronA, or 
natural introns from the antigen or adjuvant genes. 

Advantages: 

Active immunotherapy offers the possibility that cancer patients could develop long-lasting and 
vigorous immune responses against their tumors that would prolong life, slow disease progression, 
and possibly eradicate disease. When used as an adjunct therapy, active immunotherapy may 
increase quality of life by minimizing the toxicity of other conventional therapies. DNA 
vaccination in particular offers a simple approach toward generating protective immune responses. 

We have demonstrated in our MUC1 vaccination model that DNA vaccination can lead to epitope 
spreading. There are no other reports of anti-tumor efficacy engendered by coadministration of 
plasmid DNA encoding MUC1 and any other costimulatory/adjuvant molecule, particularly EL~1 8. 
hi addition, this is the only instance found so far of epitope spreading as a result of plasmid DNA 
vaccination in tumor models. As mentioned above, if this phenomenon could be induced in 
humans, it would induce immunity to MUC1 as well as to other unknown tumor-associated 
antigens that are present in the tumor. This multi -antigen attack on the tumor would minimize or 
inhibit the ability of the tumor to evade the immune response. This approach also is applicable to 
a vaccine using PSA as the antigen, or PSA in combination with other antigens and adjuvant 
molecules. 

Another advantage of our approach is the ability to encode more than one gene on a plasmid or 
DNA vehicle to enable delivery of more than one protein product to a target tissue/cell (33, 34). 
This should ensure that a target tissue expresses all desired proteins with the expectation of a more 
efficient induction of immune response. For example, we have constructed a double cistron 
vector, and for example we have shown that it is capable of expressing mouse or human IL-12. IL- 
12 is a protein comprised of two subunits that must be co-expressed in the same cell in order for 
the mature molecule to be produced. The two protein subunits are encoded by different genes, and 
we have shown in tissue culture that a double cistron vector encoding both genes results in more 
effective production of the mature protein than using two plasmids which encode either gene alone 
(33, 34). 
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Nucleic acid vaccines and Vaccination 

The present invention thus provides, in one aspect, nucleic acid vaccines using mixtures of at 
least I, and up to 50 different tumor and cytokine encoding nucleic acids that optionally each 
can express a different protein variant, or an antigenic portion thereof- As can be readily 
appreciated to one of skill in the art, 1 to about 50 different tumor protein encoding nucleic 
acids can be employed- Also provided are methods of making and using such nucleic acid 
vaccines. 

A nucleic acid vaccine of the present invention induces at least one of a humoral and a cellular 
immune response in a mammal who has been administered at least one nucleic acid vaccine, 
but the response to the vaccine is subclinical, or is effective in enhancing at least one immune 
response to at least one tumor antigen, such that the vaccine administration is suitable for 
vaccination purposes. 

DNA vaccines. An alternative to a traditional vaccine comprising an antigen and an adjuvant 
involves the direct in vivo introduction of DNA encoding the antigen into tissues of a subject 
for expression of the antigen by the cells of the subject's tissue. Such vaccines are termed 
herein "DNA vaccines" or "nucleic acid-based vaccines." DNA vaccines are described in 
International Patent Publication WO 95/20660 and International Patent Publication WO 
93/191 83 s the disclosures of which are hereby incorporated by reference in their entireties. The 
ability of directly injected DNA that encodes a viral protein to elicit a protective immune 
response has been demonstrated in numerous experimental systems (Conry et al., Cancer Res., 
54:1164-1168 (1994); Cox et al., Virol, 67:5664-5667 (1993); Davis et al., Hum. Mole. 
Genet., 2:1847-1851 (1993); Sedegah et al., Proc. NatL Acad. Sci. s 91:9866-9870 (1994); 
Montgomery et ah, DNA Cell Bio., 12:777-783 (1993); Ulmeret al., Science, 259:1745-1749 
(1993); Wang etal., Proc. Natl, Acad. Sci., 90:4156^4160 (1993); Xiang et ah, Virology, 
1 99: 132-140 (1994)). Studies to assess this strategy in neutralization of influenza virus have 
used both envelope and internal viral proteins to induce the production of antibodies, but in 
particular have focused on the viral hemagglutinin protein (HA) (Fynan et al., DNA Cell, 
Biol., 12:785-789 (1993A); Fynan et ah, Proc. NatL Acad. Sci., 90:11478-11482 (1993B); 
Robinson etal., Vaccine, 11:957, (1993); Webster et al., Vaccine, 12:1495-1498 (1994)). 

As is well known in the art, a large number of factors can influence the efficiency of expression 
of antigen genes and/or the immunogenicity of DNA vaccines. Examples of such factors 
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include the reproducibility of inoculation, construction of the plasmid vector, choice of the 
promoter used to drive antigen gene expression and stability of the inserted gene in the plasmid. 
Depending on their origin, promoters differ in tissue specificity and efficiency in initiating 
mRNA synthesis (Xiang et aL, Virology, 209:564-579 (1994); Chapman et aL, Nucle. Acids. 
Res., 19:3979-3986 (1991)). To date, most DNA vaccines in mammalian systems have relied 
upon viral promoters derived from cytomegalovirus (CMV). These have had good efficiency 
in both muscle and skin inoculation in a number of mammalian species. Another factor known 
to affect the immune response elicited by DNA immunization is the method of DNA delivery; 
parenteral routes can yield low rates of gene transfer and produce considerable variability of 
gene expression (Montgomery, 1993, supra). High-velocity inoculation of plasmids, using a 
gene-gun, enhanced the immune responses of mice (Fynan, 1993B, supra; Eisenbraun et aL, 
DNA Cell Biol., 12: 791-797 (1993)), presumably because of a greater efficiency of DNA 
transfection and more effective antigen presentation by dendritic cells. Vectors containing the 
nucleic acid-based vaccine of the invention may also be introduced into the desired host by 
other methods known in the art, e.g., transfection, electroporation, microinjection, transduction, 
cell fusion, DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fusion), or a 
DNA vector transporter (see, e.g., Wu et aL, J. Biol. Chem. 267:963-967 (1992); Wu and Wu, 
J. Biol. Chem, 263:14621-14624(1988); Hartmut et al,, Canadian Patent Application No. 
2,012,3 1 1, filed Mar. 1 5, 1990), or any other known method or device. 

Viral Vector Vaccines. As can be readily appreciated by one of ordinary skill in the art, nucleic 
acid vaccines of the present invention can also be incorporated into any recombinant virus and 
can be used to introduce a vaccine of the invention. Examples of suitable viruses that can act as 
recombinant viral hosts for vaccines, in addition to vaccinia, includes canarypox, adenovirus, 
and adeno-associated virus, as known in the art. Various genetically engineered virus hosts 
("recombinant viruses") can be used to prepare viral vaccines for administration of nucleic acid 
encoding tumor antigens, Viral vaccines can promote a suitable immune response that targets 
activation of B lymphocytes, helper T lymphocytes, and cytotoxic T lymphocytes. Numerous 
virus species can be used as the recombinant virus hosts for the vaccines of the invention. A 
preferred recombinant virus for a viral vaccine is vaccinia virus (International Patent 
Publication WO 87/06262, Oct. 22, 1987, by Moss et al.; Cooney et aL, Proc. NatL Acad. 
Sci. USA 90:1882-6 (1993); Graham et al., J. Infect. Dis. 166:244-52 (1992); McElrath et al., 
J. Infect. Dis. 169:41-7(1994)). In another embodiment, recombinant canarypox can be used 
(Pialoux et aL, AIDS Res. Hum. Retroviruses 1 1 :373-8 1 (1995), erratum in AIDS Res. Hum. 
Retroviruses 11:875 (1995); AnderssonetaL, J. Infect. Dis. 174:977-85 (1996); Fries et aL, 
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Vaccine 14:428-34 (1996); Gonczol et al., Vaccine 13:1080-5 (1995)). Another alternative is 
defective adenovirus or adenovirus (Gilardi-Hebenstreit et al„ J- Gen. Virol. 71:2425-31 
(1990); Prevec et al., J. Infect Dis. 161:27-30 (1990); Lubecketal., Proc. Natl. Acad. Sci. 
USA 86:6763-7 (1989); Xiang et al., Virology 219:220-7 (1996)). Other suitable viral vectors 
include retroviruses that are packaged in cells with amphotropic host range (see Miller, Human 
Gene Then 1:5-14 (1990); Ausubel et al., Current Protocols in Molecular Biology, sec. 9), and 
attenuated or defective DNA virus, such as but not limited to herpes simplex virus (HSV) (see, 
e.g., Kaplitt et al., Molec. Cell. Neurosci. 2:320-330 (1991)), papillomavirus, Epstein Barr 
virus (EBV), adeno-associated virus (AAV) (see, e.g., Samulski et al., J. ViroL 61:3096-3101 
(1987); Samulski et al., J. Virol. 63:3822-3828 (1989)), US Patent Nos: 5990091, 5766599, 
5756103, 6086890, 6274147, 05585254, 6140114, 5616326, 6099847, 6221136, 6086891, 
5958425, 5744143, 5558860, 5266489, 5858368, 5795872, 5693530, 6020172, and the like, 
each entirely incorporated herein by reference, 

Bi-functional plasmids for virus and DNA vaccines. Another aspect of the present invention 
concerns engineering of bi-fiinctional plasmids that can serve as a DNA vaccine and a 
recombinant virus vector. Direct injection of the purified plasmid DNA, i.e., as a DNA 
vaccine, would elicit an immune response to the antigen expressed by the plasmid in test 
subjects. The plasmid would also be useful in live, recombinant viruses as immunization 
vehicles. 

The bi-functional plasmid of the invention provides a heterologous gene, or an insertion site for 
a heterologous gene, under control of two different expression control sequences: an animal 
expression control sequence, and a viral expression control sequence. The term "under control" 
is used in its ordinary sense, i.e,, operably or operatively associated with, in the sense that the 
expression control sequence, such as a promoter, provides for expression of a heterologous 
gene. In another embodiment, the animal expression control sequence is a mammalian 
promoter (avian promoters are also contemplated by the present invention); in a specific 
embodiment, the promoter is a late or early SV40 promoter, cytomegalovirus immediate early 
(CMV) promoter, a vaccinia virus early promoter, or a vaccinia virus late promoter, or any 
combination thereof. Subjects could be vaccinated with a multi -tiered regimen, with the bi- 
fiinctional plasmid administered as DNA and, at a different time, but in any order, as a 
recombinant virus vaccine. The invention contemplates single or multiple administrations of 
the bi-functional plasmid as a DNA vaccine or as a recombinant virus vaccine, or both. This 
vaccination regimen may be complemented with administration of viral vaccines (infra), or 
may be used with additional vaccine vehicles. 
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As one of ordinary skill in the art can readily appreciate, the bi-functional plasmids of the 
invention can be used as nucleic acid vaccine vectors. Thus, by inserting at least 1 to about 50 
different tumor genes into bi-functional plasmids, thus preparing a corresponding set of bi- 
functional plasmids useful as a nucleic acid vaccine can be prepared. 

Active immunity elicited by vaccination with a tumor protein or proteins according to the 
present invention can prime or boost a cellular or humoral immune response. The tumor 
protein or proteins, or antigenic fragments thereof, can be prepared in an admixture with an 
adjuvant to prepare a vaccine. 

The term "adjuvant" refers to a compound or mixture that enhances the immune response to an 
antigen. An adjuvant can serve as a tissue depot that slowly releases the antigen and also as a 
lymphoid system activator that non-specifically enhances the immune response (Hood et ah, 
Immunology, Second Ed., 1984, Benjamin/Cummings: Menlo Park, Calif, p. 384). Often, a 
primary challenge with an antigen alone, in the absence of an adjuvant, will fail to elicit a 
humoral or cellular immune response. Adjuvants include, but are not limited to, complete 
Freund's adjuvant* incomplete Freund's adjuvant, saponin, mineral gels such as aluminum 
hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil or hydrocarbon emulsions, keyhole limpet hemocyanins, dinitrophenol, and useful 
human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 

■ 

Selection of an adjuvant depends on the subject to be vaccinated. Preferably, a 
pharmaceutically acceptable adjuvant is used. For example, a vaccine for a human should 
avoid oil or hydrocarbon emulsion adjuvants, including complete and incomplete Freund's 
adjuvant. One example of an adjuvant suitable for use with humans is alum (alumina gel). In a 
specific embodiment, recombinant tumor protein is administered intramuscularly in alum. 
Alternatively, the recombinant tumor protein vaccine can be administered subcutaneously, 
intradermally, intraperitoneally, or via other acceptable vaccine administration routes. 

Vaccine administration. According to the invention, immunization against tumors can be 
accomplished with a nucleic acid tumor/adjuvant vaccine of the invention alone, or in 
combination with a viral encoding tumor vaccine or a tumor protein vaccine, or both. In a 
specific embodiment, tumor nucleic acid or viral vaccine is provided intramuscularly (i.m.) to 
boost the immune response. 

Each dose of vaccine may contain the same 1 to 50 nucleic acid sequences encoding the same 
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or different tumor proteins or portions thereof. Alteratively, the tumor sequences in subsequent 
vaccines may express different tumor genes or portions thereof, hi yet another embodiment, 
the subsequent vaccines may have some tumor sequences in common, and others that are 
different, from the earlier vaccine. For example, the priming vaccine may contain nucleic acids 
expressing tumor proteins arbitrarily designated 1-2. A second (booster) vaccine may contain 
vaccines expressing tumor proteins 3-5 or 6-10, etc. 

Tumor Vaccine Variants 

As noted above, a tumor/adjuvant encoding nucleic acid for use in the vaccines of the invention 
can be obtained from different cancer or normal tumor patients or different geographically local 
isolates, or from geographically diverse isolates. 

A tumor/adjuvant vaccine also includes nucleic acid encoding polypeptides having 
immunogenic activity elicited by an amino acid sequence of a tumor amino acid sequence as at 
least one epitope or antigenic determinant. Such amino acid sequences substantially 
correspond to at least one 10-200 amino acid fragment and/or consensus sequence of a known 
tumor antigen protein sequence, as described herein or as known in the art. Such a tumor 
antigen sequence can have overall homology or identity of at least 50% to a known tumor 
protein amino acid sequence, such as 50-99% homology, or any range or value therein, while 
eliciting an immunogenic response against at least one type of tumor protein, preferably 
including at least one pathologic form. 

Percent homology can be determined, for example, by comparing sequence information using 
the GAP computer program, version 6.0, available from the University of Wisconsin Genetics 
Computer Group (UWGCG), The GAP program utilizes the alignment method of Needleman 
and Wunsch (J. Mol, Biol 48:443 (1970)), as revised by Smith and Waterman (Adv. Appl. 
Math, 2:482(1981)). Briefly, me GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total number of 
symbols in the shorter of the two sequences. The preferred default parameters for the GAP 
program include: (1) a unitary comparison matrix (containing a value of 1 for identities and 0 
for non-identities) and the weighted comparison matrix of Gribskov and Burgess, Nucl. Acids 
Res. 14:6745 (1986), as described by Schwartz and Dayhoff, eds f , Atlas of Protein Sequence 
and Structure, National Biomedical Research Foundation, Washington, D,C. (1979), pp. 353- 
358; (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each symbol in each 
gap; and (3) no penalty for end gaps, 
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In another embodiment, a tumor/adjuvant vaccine of the present invention comprises a 
pathologic form of at least one tumor protein. Examples of such sequences are readily 
available from commercial and institutional tumor sequence databases, such as GENBANK, or 
other publically available databases. Substitutions or insertions of a tumor or cytokine to obtain 
an additional tumor or cytokine protein, encoded by a nucleic acid for use in a viral or nucleic 
acid vaccine of the present invention, can include substitutions or insertions of at least one 
amino acid residue (e.g., 1-25 amino acids). Alternatively, at least one amino acid (e.g.-> 1-25 
amino acids) can be deleted from a turnor or cytokine sequence. Preferably, such substitutions, 
insertions or deletions are identified based on sequence determination of proteins obtained by 
nucleotide sequencing of at least one tumor or cytokine encoding nucleic acid from an 
individual. 

■ 

Non-limiting examples of such substitutions, insertions or deletions preferably are made by the 
amplification of DNA or RNA sequences from tumor, which can be determined by routine 
experimentation to provide modified structural and functional properties of an protein or a 
tumor or cytokine. The tumor or cytokine protein seuquences so obtained preferably have 
different antigenic or adjuvant properties from the original tumor or cytokine. Such antigenic 
differences can be determined by suitable assays, e.g., by testing with a panel of monoclonal 
antibodies specific for tumor or cytokine proteins in an ELISA assay, 

Any substitution, insertion or deletion can be used as long as the resulting tumor and cytokine 
proteins or antigenic determinants thereof elicits antibodies which bind to tumor proteins, but 
which tumor proteins have a different pattern than antibodies elicited by a second tumor 
protein. Each of the above substitutions, insertions or deletions can also include modified or 
unusual amino acids, e.g., as provided in 37 C.F.R* section L822(p)(2), which is entirely 
incorporated herein by reference. 

The following present non-limiting examples of alternative nucleic acid sequences (recited as 
DNA sequences, but also including the corresponding RNA sequence (where U is substituted 
for T in the corresponding RNA sequence)) of tumor antigen proteins of tumors, as well as 
cytokine adjuvant nucleic acid sequences, that can be encoded by a nucleic acid according to 
present invention. Such nucleic acid vaccines can comprise at least one tumor antigen protein 
encoding nucleic acid and at least one cytokine adjuvant protein encoding nucleic acid, and can 
include linear or circular DNA or RNA, optionally further comprising additional regulatory 
sequences, such as but not limited to promoters, enhancers, selection, restriction sites, and the 
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like, as well known in the art. For amino acid sequences any suitable codon can be used for 
expression, preferably human preferred codons as well known in the art (see, e.g., Ausubel, 
supra, Appendices) and such sequences can be further modified, e.g,, where specific antigenic 
sequences can be used. 



SEQUENCE LISTING 

PSA/KLK3 sequences 
1. PSA <SEQ ID NO:l) 

lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
15 10 15 

Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly Val Leu Val His 

20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys lie Arg Asn Lys Ser Val 
35 40 45 

lie Leu Leu Gly Arg His Ser Leu Phe His Pro Glu Asp Thr Gly Gin 
50 55 60 

Val Phe Gin Val Ser His Ser Phe Pro His Pro Leu Tyr Asp Met Ser 
65 70 75 80 

Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly Asp Asp Ser Ser His Asp 

85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu Leu Thr Asp Ala Val 

100 105 110 

Lys Val Met Asp Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu Phe Leu Thr Pro 
130 135 140 

Lys Lys Leu Gin Cys Val Asp Leu His Val lie Ser Asn Asp Val Cys 
145 150 155 160 

Ala Gin Val His Pro Gin Lys Val Thr Lys Phe Met Leu Cys Ala Gly 

165 170 175 

Arg Trp Thr Gly Gly Lys Ser Thr Cys Ser Gly Asp Ser Gly Gly Pro 

180 185 190 

Leu Val Cys Asn Gly Val Leu Gin Gly lie Thr Ser Trp Gly Ser Glu 
195 200 205 
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Pro Cys Ala Leu Pro Glu Arg Pro Ser Leu Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arg Lys Trp lie Lys Asp Thr lie Val Ala Asn Pro 
225 230 235 

PSA 1: human PSA with introns (SEQ ID NO: 2) : 

gtccgtgacg tggattggtg ctgcacccct catcctgtct cggattgtgg 
gaggctggga 60 

gtgcgagaag cattcccaac cctggcaggt gcttgtggcc tctcgtggca gggcagtctg 120 
cggcggtgtt ctggtgcacc cccagtgggt cctcacagct gcccactgca tcaggaacaa 180 
aagcgtgatc ttgctgggtc ggcacagcct gtttcatcct gaagacacag gccaggtatt 240 
tcaggtcagc cacagcttcc cacacccgct ctacgatatg agcctcctga agaatcgatt 300 
cctcaggcca ggtgatgact ccagccacga cctcatgctg ctccgcctgt cagagcctgc 3 60 
cgagctcacg gatgctgtga aggtcatgga cctgcccacc caggagccag cactggggac 420 
cacctgctac gcctcaggct ggggcagcat tgaaccagag gagttcttga ccccaaagaa 480 
acttcagtgt gtggacctcc atgttatttc caatgacgt'g tgtgcgcaag ttcaccctca 540 
gaaggtgacc aagttcatgc tgtgtgctgg acgctggaca gggggcaaaa gcacctgctc 600 
gggtgattct gggggcccac ttgtctgtaa tggtgtgctt caaggtatca cgtcatgggg 660 
cagtgaacca tgtgccctgc ccgaaaggcc ttccctgtac accaaggtgg tgcattaccg 720 
gaagtggatc aaggacacca tcgtggccaa cccctgagca cccctatcaa ccccctattg 780 
tagtaaactt ggaaccttgg aaatgaccag gccaagactc aagcctcccc agttctactg 840 
acctttgtcc ttaggtgtga ggtccagggt tgctaggaaa agaaatcagc agacacaggt 90 0 
gtagaccaga gtgtttctta aatggtgtaa ttttgtcctc tctgtgtcct ggggaatact 96 0 
ggccatgcct ggagacatat cactcaattt ctctgaggac acagatagga tggggtgtct 102 0 
gtgttatttg tggggtacag agatgaaaga ggggtgggat ccacactgag agagtggaga 10 80 
gtgacatgtg ctggacactg tccatgaagc actgagcaga agctggaggc acaacgcacc 114 0 
agacactcac agcaaggatg gagctgaaaa cataacccac tctgtcctgg aggcactggg 12 00 
aagcctagag aaggctgtga gccaaggagg gagggtcttc ctttggcatg ggatggggat 126 0 
gaagtaagga gagggactgg accccctgga agctgattca ctatgggggg aggtgtattg 1320 
aagtcctcca gacaaccctc agatttgatg atttcctagt agaactcaca gaaataaaga 13 80 
gctgttatac tgtg 13 94 

2* PSA 2: SEQ ID NO:l, comprising one or more or any combination of 
Thr4 0, Met 112, and/or deletion of one or more of Tyr 22 5 , Arg226, 
Lys227 # Trp228 r Ile229, Lys230, Asp231, Thr232 r Ile233, Val234, 
Ala235 f Asn236, Pro237. 

3. PSA 3: cDNA sequence with introns (SEQ ID NO: 3) : 

aagtttccct tctcccagtc caagacccca aatcaccaca aaggacccaa tccccagact 61 

caagatatgg tctgggcgct gtcttgtgtc tcctaccctg atccctgggt tcaactctgc 121 

tcccagagca tgaagcctct ccaccagcac cagccaccaa cctgcaaacc tagggaagat 181 

tgacagaatt cccagccttt cccagctccc cctgcccatg tcccaggact cccagccttg 241 

gttctctgcc cccgtgtctt ttcaaaccca catcctaaat ccatctccta tccgagtccc 301 

ccagttcctc ctgtcaaccc tgattcccct gatctagcac cccctctgca ggtgctgcac 361 

ccctcatcct gtctcggatt gtgggaggct gggagtgcga gaagcattcc caaccctggc 421 

aggtgcttgt agcctctcgt ggcagggcag tctgcggcgg tgttctggtg cacccccagt 481 

gggtcctcac agctacccac tgcatcagga acaaaagcgt gatcttgctg ggtcggcaca 541 

gcctgtttca tcctgaagac acaggccagg tatttcaggt cagccacagc ttcccacacc 601 

cgctctacga tatgagcctc ctgaagaatc gattcctcag gccaggtgat gactccagcc 661 

acgacctcat gctgctccgc ctgtcagagc ctgccgagct cacggatgct atgaaggtca 721 

tggacctgcc cacccaggag ccagcactgg ggaccacctg ctacgcctca ggctggggca 781 

gcattgaacc agaggagttc ttgaccccaa agaaacttca gtgtgtggac ctccatgtta 841 

tttccaatga cgtgtgtgcg caagttcacc ctcagaaggt gaccaagttc atgctgtgtg 9 01 

ctggacgctg gacagggggc aaaagcacct gctcgggtga ttctgggggc ccacttgtct 961 

gtaatggtgt gcttcaaggt atcacgtcat ggggcagtga accatgtgcc ctgcccgaaa 1021 

ggccttccct gtacaccaag gtggtgcatt accggaagtg gatcaaggac accatcgtgg 1081 
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ccaacccctg agcaccccta tcaacfcccct attgtagtaa acttggaacc ttggaaatga 1141 
ccaggccaag actcaggcct ccceagttct actgaccttt gtccttaggt gtgaggtcca 1201 
gggttgctag gaaaagaaat cagcagacac aggtgtagac cagagtgttt cttaaatggt 12 61 
gtaattttgt cctctctgtg tcctggggaa tactggccafc gcctggagac atatcactca 13 21 
atfctctctga ggacacagat aggatggggt gtctgtgtta tttgtggggt acagagatga 13 81 
aagaggggtg ggatccacac tgagagagtg gagagtgaca tgtgctggac actgtccatg 1441 
aagcactgag cagaagctgg aggcacaacg caccagacac tcacagcaag gatggagctg 15 01 
aaaacataac ccactctgtc ctggaggcac tgggaagcct agagaaggct gtgaaccaag 15 61 
gagggagggt cttcctttgg catgggatgg ggatgaagta aggagaggga ctgaccccct 1621 
ggaagctgat tcactatggg gggaggtgta ttgaagtcct ccagacaacc ctcagatttg 16 81 
atgatttcct agtagaactc acagaaataa agagctgtta tactgtgaa 

3. rhesus macaque PSA (SEQ ID NO: 4) : 



He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
15 10 15 

Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly Val Leu Val His 

20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys He Arg Ser Asn Ser Val 
35 40 45 

lie Leu Leu Gly Arg His Asn Pro Tyr Tyr Pro Glu Asp Thr Gly Gin 
50 55 60 

Val Phe Gin Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 80 

Leu Leu Lys Asn Arg Tyr Leu Gly Pro Gly Asp Asp Ser Ser His Asp 

85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu He Thr Asp Ala Val 

100 105 110 

Gin Val Leu Asp Leu Pro Thr Trp Glu Pro Glu Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu His Leu Thr Pro 
130 135 140 

Lys Lys Leu Gin Cys Val Asp Leu His He He Ser Asn Asp Val Cys 
145 150 155 160 

Ala Gin Val His Ser Gin Lys Val Thr Lys Phe Met Leu Cys Ala Gly 

165 170 175 

Ser Trp Met Gly Gly Lys Ser Thr Cys Ser Gly Asp Ser Gly Gly Pro 

180 185 190 

Leu Val Cys Asp Gly Val Leu Gin Gly He Thr Ser Trp Gly Ser Gin 
195 200 205 

Pro Cys Ala Leu Pro Arg Arg Pro Ser Leu Tyr Thr Lys Val Val Arg 
210 215 220 
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Tyr Arg Lys Trp lie Gin Asp Thr lie Met Ala Asrx Pro 
225 230 235 

PSA 4: rhesus PSA : SEQ ID NO; 4, comprising one or more or any 
combination of Thr40, Met 112, and/ or deletion of one or more of 
Tyr225, Axg226, Lys227, Trp228, Ile229, Gln230, Asp231, Thr232, 
110233, Met234, Ala235, Asn236, Pro237 . 



4, CTtt epitopes from PSA 



PSA antigen SEQ ID NO: 5: 

Phe Leu Thr Pro Lys Lys Leu Gin Cys Val 
15 10 

PSA antigen SEQ ID NO: 6: 

Lys Leu Gin Cys Val Asp Leu His Val 

1 5 

PSA antigen SEQ ID NO; 7 

Val lie Ser Asn Asp Val Cys Ala Gin Val 
15 10 

PSA antigen SEQ ID NO: 8 

Val Leu Val His Pro Gin Trp Val Leu 
1 5 



PSA antigen SEQ ID NO : 9 

Gin Val His Pro Gin Lys Val Thr Lys . 
1 5 

5, PSA antigen SEQ ID NO: 10: 

Val Val Phe Leu Thr Leu Ser Val Thr Trp lie Gly Ala Ala Pro Leu 
15 10 15 

lie Leu Ser Arg lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin 

20 25 30 

Pro Trp Gin Val Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly 

35 40 45 

Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His Cys lie Arg 
50 55 60 

Asn Lys Ser Val lie Leu Leu Gly Arg His Ser Leu Phe His Pro Glu 
65 70 75 80 

Asp Thr Gly Gin Val Phe Gin Val Ser His Ser Phe Pro His Pro Leu 

85 90 95 

Tyr Asp Met Ser Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly Asp Asp 
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100 105 110 

Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu Leu 
115 120 125 

Thr Asp Ala Val Lys Val Met Asp Leu Pro Thr Gin Glu Pro Ala Leu 
130 135 140 

Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu 
145 150 155 160 

Phe Leu Thr Pro Lys Lys Leu Gin Cys Val Asp Leu His Val He Ser 

165 170 175 

Asn Asp Val Cys Ala Gin Val His Pro Gin Lys Val Thr Lys Phe Met 

180 185 190 

-f 

Leu Cys Ala Gly Arg Trp Thr Gly Gly Lys Ser Thr Cys Ser Trp Val 
195 200 205 

lie Leu He Thr Glu Leu Thr Met Pro Ala Leu Pro Met Val Leu His 
210 215 220 

Gly Ser Leu Val Pro Trp Arg Gly Gly Val 
225 230 



PSA CDNA {SEQ ID NO: 11) 

ggttgtcttc ctcaccctgt ccgtgacgtg gattggtgct gcacccctca tcctgtctcg 60 

gattgtggga ggctgggagt gcgagaagca ttcccaaccc tggcaggtgc ttgtggcctc 12 0 

tcgtggcagg gcagtctgcg gcggtgttct ggtgcacccc cagtgggtcc tcacagctgc ISO 
ccactgcatc aggaacaaaa gcgtgatctt gctgggtcgg cacagcctgt ttcatcctga 24 0 
agacacaggc caggtatttc aggtcagcca cagcttccca cacccgctct acgatatgag 300 

cctcctgaag aatcgattcc tcaggccagg tgatgactcc agccacgacc tcatgctgct 3 60 
ccgcctgtca gagcctgccg agctcacgga tgctgtgaag gtcatggacc tgcccaccca 420 

ggagccagca ctggggacca cctgctacgc ctcaggctgg ggcagcattg aaccagagga 4 80 

gttcttgacc ccaaagaaac ttcagtgtgt ggacctccat gttatttcca atgacgtgtg 540 

tgcgcaagtt caccctcaga aggtgaccaa gfctcatgctg tgtgctggac gctggacagg 6 00 

gggcaaaagc acctgctcgt gggtcattcfc gatcaccgaa ctgaccatgc cagccctgcc 660 

gatggtcctc catggctccc tagtgccctg gagaggaggt gtctagtcag agagtagtcc 720 

tggaaggtgg cctctgtgag gagccacggg gacagcatcc tgcagatggt cctggccctt 780 

gtcccaccga cctgtctaca aggactgtcc tcgtggaccc tcccctctgc acaggagctg 84 0 

gaccctgaag tcccttccct accggccagg actggagccc ctacccctct gttggaatcc 900 

ctgcccacct tcttctggaa gtcggctctg gagacatttc tctcttcttc caaagctggg 960 

aactgctatc tgttatctgc ctgtccaggt ctgaaagata ggattgccca ggcagaaact 1020 

gggactgacc tatctcactc tctccctgct tttaccctta gggtgattct gggggcccac 10 80 

ttgtctgtaa tggtgtgctt caaggtatca cgtcatgggg cagtgaacca tgtgccctgc 1140 

ccgaaaggcc ttccctgtac accaaggtgg tgcattaccg gaagtggatc aaggacacca 1200 

tcgtggccaa cccctgagca cccctatcaa ctccctattg tagtaaactt ggaaccttgg 1260 

aaatgaccag gccaagactc aagcctcccc agttctactg acctttgtcc ttaggtgtga 132 0 

ggtccagggt tgctaggaaa agaaatcagc agacacaggt gtagaccaga gtgtttctta 1380 

aatggtgtaa ttttgtcctc tctgtgtcct ggggaatact ggccatgcct ggagacatat 144 0 

cactcaattt ctctgaggac acagatagga tgggttgtct gtgttatttg tggggtacag 150 0 

agatgaaaga ggggtgggga tccacactga gagagtggag agtgacatgt gctggacact 156 0 

gtccatgaag cactgagcag aagctggagg cacaacgcac cagacactca cagcaaggat 162 0 

ggagctgaaa acataaccca ctctgtcctg gagg 1654 
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PSA ANTIGEN AA SEQ ID HO: 12 

Val Val Phe Leu Thr Leu Ser Val Thr Trp He Gly Ala Ala Pro Leu 
15 10 15 

He Leu Ser Arg lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin 

20 25 30 

Pro Trp Gin Val Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly 
35 40 45 

Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His Cys He Arg 
50 55 60 

Asn Lys Ser Val He Leu Leu Gly Arg His Ser Leu Phe His Pro Glu 
65 70 75 80 

Asp Thr Gly Gin Val Phe Gin Val Ser His Ser Phe Pro. His Pro Leu 

85 90 95 

Tyr Asp Met Ser Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly Asp Asp 

100 105 110 

Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu Leu 
115 120 - 125 

Thr Asp Ala Val Lys Val Met Asp Leu Pro Thr Gin Glu Pro Ala Leu 
130 135 140 

Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu 
145 150 155 160 

Cys Thr Pro Gly Pro Asp Gly Ala Ala Gly Ser Pro Asp Ala Trp Val 

165 170 175 

PSA ANTIGEN DNA SEQ ID NO : 13 

ggttgtcttc ctcaccctgt ccgtgacgtg gattggtgct gcacccctca 
tcctgtctcg 60 

gattgtggga ggctgggagt gcgagaagca ttcccaaccc tggcaggtgc 
ttgtggcctc 12 0 

tcgtggcagg gcagtctgcg gcggtgttct ggtgcacccc cagtgggtcc 
tcacagctgc 180 

ccactgcatc aggaacaaaa gcgtgatctt gctgggtcgg cacagcctgt 
ttcatcctga 240 

agacacaggc caggtatttc aggtcagcca cagcttccca cacccgctct 
acgatatgag 3 00 

cctcctgaag aatcgattcc tcaggccagg tgatgactcc agccacgacc 
tcatgctgct 360 

ccgcctgtca gagcctgccg agctcacgga tgctgtgaag gtcatggacc 
tgcccaccca 420 

ggagccagca ctggggacca cctgctacgc ctcaggctgg ggcagcattg 
aaccagagga 48 0 

gtgtacgcct gggccagatg gtgcagccgg gagcccagat gcctgggtct 
gagggaggag 54 0 



24 



WO 03/031569 



PCT/US02/29640 



gggacaggac tcctgggtct gagggaggag ggccaaggaa ccaggtgggg 
tccagcccac 600 

aacagtgttt tttgcctggc ccgtagtctt gaccccaaag aaacttcagt gtgtggac 
6SB 



PSA ANTIGEN AA SEQ ID NO: 14 

Val Val Phe Leu Thr Leu Ser Val Thr Trp lie Gly Ala Ala Pro Leu 
15 10 15 

lie Leu Ser Arg lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin 

20 25 30 

Pro Trp Gin Val Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly 
35 40 45 

Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His Cys lie Arg 
50 55 60 

Asn Lya Ser Val lie Leu Leu Gly Arg His Ser Leu Phe His Pro Glu 
65 70 75 80 

Asp Thr Gly Gin Val Phe Gin Val Ser His Ser Phe Pro His Pro Leu 

85 90 95 

Tyr Asp Met Ser Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly Asp Asp 

100 105 110 

Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu Leu 
115 120 125 

Thr Asp Ala Val Lys Val Met Asp Leu Pro Thr Gin Glu Pro Ala Leu 
130 135 140 

Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu 
145 150 155 160 

Cys Thr Pro Gly Pro Asp Gly Ala Ala Gly Ser Pro Asp Ala Trp Val 

165 170 175 



PSA ANTIGEN AA SEQ ID NO: 15 



lie Val Gly Gly 
1 

Leu Val Ala Ser 

20 

Pro Gin Trp Val 
35 

Asp Ser Ser His 
50 

Leu Thr Asp Ala 
65 



Trp Glu Cys Glu 

5 

Arg Gly Arg Ala 



Leu Thr Ala Ala 

40 

Asp Leu Met Leu 
55 

Val Lys Val Met 
70 



Lys His Ser Gin 
10 

Val Cys Gly Gly 
25 

His Cys lie Arg 



Leu Arg Leu Ser 

60 

Asp Leu Pro Thr 
75 



Pro Trp Gin Val 
15 

Val Leu Val His 
30 

Lys Pro Gly Asp 
45 

Glu Pro Ala Glu 



Gin Glu Pro Ala 

ao 
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Leu Gly Tlir Thr 



Glu Phe Leu Thr 

100 

Ser Asn Asp Val 
115 

Met Leu Cys Ala 
13 0 

Asp Ser Gly Gly 
145 

Ser Trp Gly Ser 



Thr Lys Val Val 

180 



Cys Tyr Ala Ser 
85 

Pro Lys Lys Leu 



Cys Ala Gin Val 

12 0 

Gly Arg Trp Thr 
135 

Pro Leu Val Cys 
150 

Glu Pro Cys Ala 
165 

His Tyr Arg Lys 



Gly Trp Gly Ser 
90 

Gin Cys Val Asp 
105 

His Pro Gin Lys 



Gly Gly Lys Ser 

140 

Asn Gly Val Leu 
155 

Leu Pro Glu Arg 
170 

Trp lie Lys Asp 
185 



lie Glu Pro Glu 
95 

Leu His Val lie 
110 

Val Thr Lys Phe 
125 

Thr Cys Ser Gly 



Gin Gly lie Thr 

160 

Pro Ser Leu Tyr 
175 

Thr lie Val Ala 
190 



Asn Pro 

XI. KLK2 sequences 
KLK2 AA SEQ ID NO: 16 

lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
1 5 10 15 

Ala Val Tyr Ser Hie Gly Trp Ala His Cys Gly Gly Val Leu Val His 

20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin 
35 40 45 

Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin 
50 55 60 

Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 80 

Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp 

85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys lie Thr Asp Val Val 

100 105 110 

Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu Phe Leu Arg Pro 
130 135 140 

Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met Cys 
145 150 155 160 



Ala Arg Ala Tyr Ser Glu Lys val Thr Glu Phe Met Leu Cys Ala Gly 
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165 170 175 

Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly Pro 

180 185 190 

Leu Val Cys Asn Gly Val Leu Gin Gly lie Thr Ser Trp Gly Pro Glu 
195 200 205 

Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arg Lys Trp lie Lys Asp Thr He Ala Ala Asn Pro 
225 230 235 

KLK2 DNA SEQ ID NO: 17 

gctggafcgtg gtggtgcatg cttgtggtct cagctatcct ggaggctgag acaggagaat 60 
cggttgagtc tgggagttca aggctacagg gagctgcgat cacgccgctg cactccagcc 120 
tgggaaacag agtgagactg tctcagaatt tttttaaaaa agaatcagtg atcatcccaa 180 
cccctgttgc tgttcatcct gagcctgcct tctctggctt tgttccctag atcacatctc 240 
catgatccat aggccctgcc caatctgacc tcacaccgtg ggaatgcctc cagactgatc 300 
tagtatgtgt ggaacagcaa gtgctggctc tccctcccct tccacagctc tgggtgtggg 360 
agggggttgt ccagcctcca gcagcatggg gagggccttg gtcagcatct aggtgccaac 42 0 
agggcaaggg cggggtcctg gagaatgaag gctttatagg gctcctcagg gaggcccccc 480 
agccccaaac tgcaccacct ggccgtggac acctgtgtca gcatgtggga cctggttctc 54 0 
tccatcgcct tgtctgtggg gtgcactggt gagattgggg ggataaagga aggggggcgg 600 
gttctgactc ttatgctgaa gcccttttcc tcccacccag tgccccagcc tcgtcccttc 660 
agcccacagt tcagcccaga caatgtgccc ctgactcttc cacattgcaa tagtcctcat 72 0 
gcccacacta ggtccccgct ccctcccact tacctcagac ctttctctcc attgcccagc 78 0 
caaatccctg ctcccagctg ctttactaaa gagcaagttc ctaggcatct ctgtgtttct 84 0 
ctttatgggg ttcaaaacct ttcaaggacc tctctccatg ccactggttc cttggaccct 90 0 
atcactgggc tgcctcctga gcccctcagt cctaccacag tctactgact tttcccattc 960 
agctgtgagc attcaaccct gtcccctgga ccttgacacc tggctcccca accctgtccc 1020 
aggaaaccca gattccacca gacacttcct tcttcccccc cgaggcfcatc tggcctgaga 1080 
caacaaatgc tgcctcccac cctgagtctg gcactgggac tfctcagaact cctccttccc 1140 
tgactctttg ccccagaccc gtcattcaat ggctagcttt ttccatggga agaagaacaa 1200 
cgagcacccc caaccacaac ggccagttct ctgattccct aaatccgcac ccttttcaaa 1260 
acctcaaaaa caaaacaaaa caaaacaaag caagaaacaa ctcaggcaaa acttgttgct 13 2 0 
taaccttgga catggtaaac catccaaaac cttcctctcc cagcaactaa acctctccac 13 80 
tgggcactta acctttggtt tcttggaacc tcttaatctc ttagaaccca cagctgccac 144 0 
cacatgccct tctcccaatg taagacccca aatcactcca aatgacccaa cccccaaccc 1500 
atgcctcctt cagatatttc ccatgtcccc tactctgatc tctggggtca gctccgttct 1560 
cgagagcatg aagcctcccg acctggtcca gccaccaacc cgctaacgca gggaatagct 162 0 
acagaattgc cagccctccc aggacccctt gcttgtgtcc tggactccca gtcctggtcc 1680 
tctgccccca tgtctcttca aacccacagc tcagctccct cccctatcca attcttttgg 1740 
gtctgatccc cctgacccag caccccctcc gcaggtgccg tgcccctcat ccagtctcgg 18 00 
attgtgggag gctgggagtg tgagaagcat tcccaaccct ggcaggtggc tgtgtacagt 18 60 
catggatggg cacactgtgg gggtgtcctg gtgcaccccc agtgggtgct cacagctgcc 1920 
cattgcctaa agaagtaagt aggaccctgg gatctgggga gggaatggct gtgtcccaca 1980 
ggaataacag cgggatgctt cccccagggt cacttctcag gtgaggcttc agactaaagg 2 040 
agagagggaa ggtcctggcc caggtcgcac ccggaggcag agctggggct ggaccactct 2100 
ccccatggct gcctgggttt ctctctgtgt ctgatctcgc tgtgtctctt ggtatctggc 2160 
tctggttgtg tctgtatgac tgtgttttgg tctctatgtc cctctctctt ttctgtctcc 2220 
ctgtgtctgt gtctcccccg tctcfcgtctc tgggtctctc tgtggccatc tctgtcaccg 2280 
tgtgtctcac cctgcatctc tttgcctgtc tttctctctg ggtctctgcc tcagcccttc 2340 
ctcatcacta ctgaacacac cccgtgaggt gggtggggag cacccagaaa aaggaaggac 2400 
tttaagctca atgtgtgtgc atgtgagggg gtgcctgtca ttgcacagca ctctctgcag 2460 
gacatccctc caccctgggg agacacaggg agggctggtt tcagctgtag ctgggtgcac 252 0 
agttgaggag ggaggaagga gaaggggaaa caagaaagga ggggaaggtg gccgggcacg 258 0 
gtggcccacg cctgtaatcc cagcactttg ggaggccgag gtgggtggat catctgaggt 264 0 
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caggagtttg aaaccagcct ggccaacatg gcaaaacccc gtctctacta aaaatacaaa 2700 

aagtagccag gcgtggtgct gcgcgcctgt aatccaatta ctagggaggc tgaggcagga 2760 

gaatcgcttg aacccgggag gcagaggttg cagtgagccg agatcgtgcc actgcactcc 282 0 

agcctgggtg acagagcaag actccatctc agaaaaaaca aacaaacaaa caaacaacaa 2880 

aaaaaatcga aaggagggga agggagctgg agagagaaag ggggacatgg ccctgagctg 2 940 

tgggccgggc cacccgccac tacagagccc tcactccagc cccagctgca ggtgagccac 3000 

cctcatgcct ctcctcctcc ccctgctact ccacactcct cagatgcccc cgtggcctcc 3060 

ctcctttttc tctcccacac tgtatcaccc ctggcttcct ctctgctgtt tctccttctc 312 0 

tctctgactt cccgcatcct tttctcattt gtctatttct cactcccttc ctggttctgt 3180 

tctttctccc ttcctcttcc ccatgtctat ttcttgctgt ctctgtctct tctttgctca 3240 

tcctaattct cactgttctc ccttctgttt ttgtcattcc tctgccattt tatgctctct 33 00 

cttttccact tcgtttcttt cagtttctgt ctctgcctct cacatgatca cactcctgtt 3 360 

ttctaactca ctgtctgtat ttcaccacga ctatatctcc ccgacccctg tgcttttctc 3420 

actgtttctt tttcttccct ttggagtctc ccttatcctc ccctgcccca tctacctttc 3480 

cccattttct ctctcctcat gcatccaccc ccttcctccc caggaatagc caggtctggc 354 0 

tgggtcggca caacctgttt gagcctgaag acacaggcca gagggtccct gtcagccaca 3600 

gcttcccaca cccgctctac aatatgagcc ttctgaagca tcaaagcctt agaccagatg 3660 

aagactccag ccatgacctc atgctgctcc gcctgtcaga gcctgccaag atcacagatg 372 0 

ttgtgaaggt cctgggcctg cccacccagg agccagcact ggggaccacc tgctacgcct 3780 

caggctgggg cagcatcgaa ccagaggagt gtacgcctgg gccagatggt gtagctggga 3 84 0 

gcccagatgc ctgggtctga gggaagtggg gccaaagaac caggtggggt ccggccacag 3 90 0 

cccagttttt ctctgaccca tagtcttgcg ccccaggagt ctfccagtgtg tgagcctcca 3960 

tctcctgtcc aatgacatgt gtgctagagc ttactctgag aaggtgacag agttcatgtt 4020 

gtgtgctggg ctctggacag gtggtaaaga cacttgtggg gtgagtcatc cctactccca 408 0 

acatctggag gggaaaggtg agtgaagacc ctaattctgg gctgcaatct gaaagctaac 414 0 

cagacatctg cctcccctgc tccccagcta tagccacgcc ccctccccat gcctcatctg 42 00 

ccgccctcct tcccccttcc ctgactccct caacacaaga ggtgattctc acagcataat 42 6 0 

tcacccattc ctgtgttgag cacatgctta ctgggcactg ctacgtgacc agcattgccg 432 0 

tagaccctgg gaagcagcag tgaacaggta gagagcagcc tctccctcct gcagccccca 438 0 

tgctggtgag gggcactggc aggaacagtg gacccaacat ggaaatgctg gagggtgtca 444 0 

ggaagtgatc gggctctggg gcagggagga ggggtgggga gtgtcactgg gaggggacat 45 00 

cctgcagaag gtaggagtga gcaaacaccc gctgcagggg aggggagagc cctgcggcac 45 60 

ctgggggagc agagggagca gcacctgccc aggcctggga ggaggggccg ggagggcgtg 462 0 

aggaggagcg agggggctgc atggctggag tgagggatca ggggcagggc gcgagatggc 468 0 

ctcacacagg gaagagaggg cccctcctgc agggcctcac ctgggccaca ggaggacact 474 0 

gcttttcctc tgaggagtca ggagctgtgg atggtgctgg acagaagaag gacagggcct 4800 

ggctcaggtg tccagaggct gtcgctggct tccctttggg atcagactgc agggagggag 4860 

ggcggcaggg ttgtgggggg agtgacgatg aggatgacct gggggtggct ccaggccttg 492 0 

cccctgcctg ggccctcacc cagcctccct cacagtctcc tggccctcca gtctctcccc 498 0 

tccactccat cctccatctg gcctcagtgg gtcattctga tcactgaact gaccataccc 504 0 

agccctgccc acggccctcc atggctcccc aatgccctgg agaggggaca tctagtcaga 510 0 

gagtagtcct gaagaggtgg cctctgcgat gtgcctgtgg gggcagcaac ctgcagatgg 516 0 

tcccggccct catcctgctg acctgtctgc agggatgtcc tcctggacct tgcccctgtg 5220 

caggagctgg accctgaagt cccctcccca taggccaaga ctggagcctt gttccctctg 52 80 

ttggactccc tgcccatatt cttgtgggag tgggttctgg agacatttct gtctgttcct 5340 

gagagctggg aattgctctc agtcatctgc ctgcgcggtt ctgagagatg gagttgccta 5400 

ggcagttatt ggggccaatc tttctcactg tgtctctcct cctttaccct fcagggtgatt 5460 

ctgggggtcc acttgtctgt aatggggtgc ttcaaggtat cacatcatgg ggccctgagc 5520 

catgtgccct gcctgaaaag cctgctgtgt acaccaaggt ggtgcattac cggaagtgga 5580 

tcaaggacac catcgcagcc aacccctgag tgcccctgtc ccacccctac ctctagtaaa 5640 

tttaagtcca cctcacgttc tggcatcact tggcctttct ggatgctgga cacctgaagc 5700 

ttggaactca cctggccgaa gctcgagcct cctgagtcct actgacctgt gctttctggt 5760 

gtggagtcca gggctgctag gaaaaggaat gggcagacac aggtgtatgc caatgtttct 5 820 

gaaatgggta taatttcgtc ctctccttcg gaacactggc tgtctctgaa gacttctcgc 5880 

tcagtttcag tgaggacaca cacaaagacg tgggtgacca tgttgtttgt ggggtgcaga 5 940 

gatgggaggg gtggggccca cctggaagag tggacagtga cacaaggtgg acactctcta 6 000 

cagatcactg aggataagct ggagccacaa tgcafcgaggc acacacacag caaggatgac 6 060 

gctgtaaaca tagcccacgc tgtcctgggg gcactgggaa gcctagataa ggccgtgagc 6120 
agaaagaagg ggaggatcc 6139 
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human KLK2 AA SEQ ID NO: 18 

lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
1 5 10 15 

Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val His 

20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin 
35 40 45 

Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin 
50 55 60 

Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 80 

Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp 

85 SO SB 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys lie Thr Asp Val Val 

100 105 110 

Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu 
130 135 140 

human KLK2 AA SEQ ZD NO* 19 

He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
1 5 10 15 

Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val His 

20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin 
35 40 45 

Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin 
50 55 60 

Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 80 

Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp 

85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys He Thr Asp Val Val 

100 105 110 

Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu Phe Leu Arg Pro 
130 135 140 
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Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met Cys 
145 150 155 160 

Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala Gly 

165 170 175 

Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Val Ser His Pro Tyr Ser 

180 185 190 

Gin His Leu Glu Gly Lys Gly 
195 

III. MUC1 Sequences 

human MUC1 AA: (SEQ ID NO: 20) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 

1 .5 .10 15 

Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 

20 25 30 

Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 45 

Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His 
50 55 60 

Ser Pro Gly Ser Gly Ser Ser Thr Thr Gin Gly Gin Asp Val Thr Leu 
65 70 75 SO 

Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gin 

85 90 95 



Asp Val Thr Ser 

100 

Pro Pro Ala His 
115 

Gly Ser Thr Ala 
130 

Arg Pro Pro Pro 
145 

Ala Pro Asp Thr 



Gly Val Thr Ser 

180 

Pro Pro Ala His 
195 

Ala Ser Thr Ala 
210 

Ala Ser Gly Ser 
225 



Val Pro Val Thr 



Asp Val Thr Ser 

120 

Pro Pro Ala His 
135 

Gly Ser Thr Ala 
150 

Arg Pro Pro Pro 
165 

Ala Pro Asp Thr 



Gly Val Thr Ser 

200 

Pro Pro Val His 
215 

Ala Ser Thr Leu 
230 



Arg Pro Ala Leu 
105 

Ala Pro Asp Asn 



Gly Val Thr Ser 

14 0 

Pro Pro Ala His 
155 

Gly Ser Thr Ala 
170 

Arg Pro Ala Pro 
185 

Ala Pro Asp Asn 



Asn Val Thr Ser 

220 

Val His Asn Gly 
235 

30 



Gly Ser Thr Thr 
110 

Lys Pro Ala Pro 
125 

Ala Pro Asp Thr 



Gly Val Thr Ser 

160 

Pro Ala Ala His 
175 

Gly Ser Thr Ala 
190 

Arg Pro Ala Leu 
205 

Ala Ser Gly Ser 



Thr Ser Ala Arg 
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Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser lie Pro Ser 

245 250 255 

His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr 

260 265 270 

Asp Ala Ser Ser Thr His His Ser Thr Val Pro Pro Leu Thr Ser Ser 
275 280 285 

Asn His Ser Thr Ser Pro Gin Leu Ser Thr Gly val Ser Phe Phe Phe 
290 295 300 

Leu Ser Phe His lie Ser Asn Leu Gin Phe Asn Ser Ser Leu Glu Asp 
305 310 315 320 

Pro Ser Thr Asp Tyr Tyr Gin Glu Leu Gin Arg Asp lie Ser Glu Met 

325 330 335 

Phe Leu Gin lie Tyr Lys Gin Gly Gly Phe Leu Gly Leu Ser Asn lie 

340 345 350 



Lys Phe Arg Pro 
355 

Glu Gly Thr He 
370 

Lys Thr Glu Ala 
385 

Val Ser Asp Val 



Pro Gly Trp Gly 

420 

Leu Ala lie Val 
435 

Lys Asn Tyr Gly 
450 

Pro Met Ser Glu 
465 

Pro Ser Ser Thr 



Gly Gly Ser Ser 

500 



Gly Ser Val Val 

360 

Asn Val His Asp 
375 

Ala Ser Arg Tyr 
390 

Pro Phe Pro Phe 
405 

lie Ala Leu Leu 



Tyr Leu He Ala 

440 

Gin Leu Asp He 

455 

Tyr Pro Thr Tyr 
470 

Asp Arg Ser Pro 
485 

Leu Ser Tyr Thr 



Val Gin Leu Thr 



Val Glu Thr Gin 

380 

Asn Leu Thr He 
3 95 

Ser Ala Gin Ser 
410 

Val Leu Val Cys 
425 

Leu Ala Val Cys 



Phe Pro Ala Arg 

460 

His Thr His Gly 
475 

Tyr Glu Lys Val 
490 

Asn Pro Ala Val 
505 



Leu Ala Phe Arg 
365 

Phe Asn Gin Tyr 



Ser Asp Val Ser 

400 

Gly Ala Gly Val 
415 

Val Leu Val Ala 
430 

Gin Cys Arg Arg 
445 

Asp Thr Tyr His 



Arg Tyr Val Pro 

480 

Ser Ala Gly Asn 
495 

Ala Ala Thr Ser 
510 



Ala Asn Leu 
515 



MUC1 DNA sequence; {SEQ ID NOt21) 

gaattccctg gctgcttgaa tctgttctgc cccctcccca cccatttcac caccaccatg 60 
acaccgggca cccagtctcc tttcttcctg ctgctgctcc tcacagtgct tacagttgtt 120 
acaggttctg gtcatgcaag ctctacccca ggtggagaaa aggagacttc ggctacccag 180 
agaagttcag tgcccagctc tactgagaag aatgctgtga gtatgaccag cagcgtactc 240 
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tccagccaca gccccggttc aggctcctcc accactcagg gacaggatgt cactctggcc 3 00 
ccggccacgg aaccagcttc aggttcagct gccacctggg gacaggatgt cacctcggtc 36 0 
ccagtcacca ggccagccct gggctccacc accccgccag cccacgatgt cacctcagcc 42 0 
ccggacaaca agccagcccc gggctccacc gcccccccag cccacggtgt cacctcggcc 480 
ccggacacca ggccgccccc gggctccacc gcccccccag cccacggtgt cacctcggcc 540 
ccggacacca ggccgccccc gggctccacc gcgcccgcag cccacggtgt cacctcggcc 60 0 
ccggacacca ggccggcccc gggctccacc gcccccccag cccatggtgt cacctcggcc 660 
ccggacaaca ggcccgcctt ggcgtccacc gcccctccag tccacaatgt cacctcggcc 72 0 
tcaggctctg catcaggctc agcttctact ctggtgcaca acggcacctc tgccagggct 780 
accacaaccc cagccagcaa gagcactcca ttctcaattc ccagccacca ctctgatact 840 
cctaccaccc ttgccagcca tagcaccaag actgatgcca gtagcactca ccatagcacg 900 
gtacctcctc tcacctcctc caatcacagc acttctcccc agttgtctac tggggtctct 960 
ttctttttcc tgtcttttca catttcaaac ctccagttta attcctctct ggaagatccc 1020 
agcaccgact actaccaaga gctgcagaga gacatttctg aaatgttttt gcagatttat 1080 
aaacaagggg gttttctggg cctctccaat attaagttca ggccaggatc tgtggtggta 1140 
caattgactc tggccttccg agaaggtacc atcaatgtcc acgacgtgga gacacagttc 1200 
aatcagtata aaacggaagc agcctctcga tataacctga cgatctcaga cgtcagcgtg 126 0 
agtgatgtgc catttccttt ctctgcccag tctggggctg gggtgccagg ctggggcatc 132 0 
gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg 13 8 0 
gctgtctgtc agtgccgccg aaagaactac gggcagctgg acatctttcc agcccgggat 144 0 
acctaccatc ctatgagcga gtaccccacc taccacaccc atgggcgcta tgtgccccct 15 0 0 
agcagtaccg atcgtagccc ctatgagaag gtttctgcag gtaatggtgg cagcagcctc 15 6 0 
tcttacacaa acccagcagt ggcagccact tctgccaact tgtaggggca cgtcgccctc 162 0 
tgagctgagt ggccagccag tgccattcca ctccactcag ggctctctgg gccagtcctc 168 0 
ctgggagccc ccaccacaac acttcccagg catggaattc c 1721 



1. Complete coding sequence of MtJCl (genomic and protein translation, 
but does not include complete set of tandem repeats , probably- in 
interest of space) 

: (SEQ ID NO: 2 2) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 

20 25 30 

Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 45 

Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His 
50 55 60 

Ser Pro Gly Ser Gly Ser Ser Thr Thr Gin Gly Gin Asp Val Thr Leu 
65 70 75 80 

Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gin 

85 90 95 

Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr 

100 105 110 

Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro 
115 120 125 

Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr 
130 135 140 
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Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
145 ISO 155 160 

Ala Pro Asp Asn Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His 

165 170 175 

Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu 

180 185 190 

Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys 
195 200 205 

Ser Thr Pro Phe Ser He Pro Ser His His Ser Asp Thr Pro Thr Thr 
210 215 220 

Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser 
225 230 235 240 

Thr Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gin Leu 

245 250 255 

Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His He Ser Asn Leu 

260 265 270 

Gin Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gin Glu 
275 260 285 

Leu Gin Arg Asp He Ser Glu Met Phe Leu Gin He Tyr Lys Gin Gly 
290 295 300 

Gly Phe Leu Gly Leu Ser Asn He Lys Phe Arg Pro Gly Ser Val Val 
305 310 315 320 

Val Gin Leu Thr Leu Ala Phe Arg Glu Gly Thr He Asn Val His Asp 

325 330 335 

Val Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala Ala Ser Arg Tyr 

340 345 350 

Asn Leu Thr lie Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe 
355 360 365 

Ser Ala Gin Ser Gly Ala Gly Val Pro Gly. Trp Gly He Ala Leu Leu 
370 375 380 

Val Leu Val Cys Val Leu Val Ala Leu Ala lie Val Tyr Leu He Ala 
385 390 395 400 

Leu Ala Val Cys Gin Cys Arg Arg Lys Asn Tyr Gly Gin Leu Asp He 

405 410 415 

Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr 

420 425 430 

His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro 
435 440 445 

Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr 
450 455 460 
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Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu 
465 470 475 

MUC-1 DNA SEQ ID NO : 23 

gaattcagaa ttttagaccc ttfcggccttg gggtccatcc tggagaccct gaggtctaag 60 
ctacagcccc fccagccaacc acagaccctt cfcctggctcc caaaaggagt tcagtcccag 12 0 
agggtggtca cccacccttc agggatgaga agttttcaag gggtattact caggcactaa 18 0 
ccccaggaaa gatgacagca cattgccata aagttttggt fcgttttctaa gccagtgcaa 24 0 
ctgcttattt tagggatttt ccgggatagg gtggggaagt ggaaggaatc ggcgagtaga 3 00 
agagaaagcc tgggagggtg gaagttaggg atctagggga agtttggctg atttggggat 3 60 
gcgggtgggg gaggtgctgg atggagttaa gtgaaggata gggtgcctga gggaggatgc 42 0 
ccgaagtcct cccagaccca cttactcacg gtggcagcgg cgacactcca gtctatcaaa 4 80 
gatccgccgg gatggagagc caggaggcgg gggctgcccc tgaggtagcg gggaggccgg 54 0 
ggggccgggg ggcggacggg acgagtgcaa tatfcggcggg ggaaaaaaca acactgcacc 60 0 
gcgtcccgtc cctcccgccc gcccgggccc ggatcccgct ccccaccgcc tgaagccggc 660 
ccgacccgga acccgggccg ctggggagtt gggttcacct tggaggccag agagacttgg 7 20 
cgcccggaag caaagggaat ggcaaggggg aggggggagg gagaacggga gtttgcggag 7 80 
tccagaaggc cgctttccga cgcccgggcg ttgcgcgcgc ttgctcttta agtactcaga 84 0 
ctgcgcggcg cgagccgtcc gcatggtgac gcgtgtccca gcaaccgaac tgaatggctg 900 
ttgcttggca atgccgggag ttgaggtttg gggccgccca cctagctact cgtgttttct 960 
ccggcctgcg agfctgggggg ctcccgcctc cccggcccgc tcctgggcgc gctgacgtca 102 0 
gatgtcccca ccccgcccag cgcctgcccc aagggtctcg ccgcacacaa agctcggcct 10 80 
cgggcgccgg cgcgcgggcg agagcggtgg tcfcctcgcct gctgatctga tgcgctccaa 1140 
tcccgtgcct cgccgaagtg tttttaaagt gttctttcca acctgtgtct ttggggctga 1200 
gaactgtttt ctgaatacag gcggaactgc ttccgtcggc ctagaggcac gctgcgactg 1260 
cgggacccaa gttccacgtg ctgccgcggc ctgggatagc ttcctcccct: cgtgcactgc 13 2 0 
tgccgcacac acctcttggc tgtcgcgcat tacgcacctc acgtgtgctt ttgccccccg 1380 
ctacgtgcct acctgtcccc aataccactc tgctccccaa aggatagttc tgtgtccgta 1440 
aatcccattc tgtcacccca cctactctct gcccccccct tttttgtttt gagacggagc 1500 
tttgctctgt cgcccaggct ggagtgcaafc ggcgcgatct cggctcactg caacctccgc 15 60 
ctcccgggtt caagcgattc tcctgcctca gcctcctgag tagctggggt tacagcgccc 162 0 
gccaccacgc tcggctaatt tttgtagttt ttagtagaga cgaggtttca ccatcttggc 1680 
caggctggtc ttgaacccct gaccttgtga tccactcgcc tcggccttcc aaagtgttgg 174 0 
gattacgggc gtgacgaccg tgccacgcat ctgcctctta agtacataac ggcccacaca 1800 
gaacgtgtcc aactcccccg cccacgttcc aacgtcctct cccacatacc tcggtgcccc 1860 
ttccacatac ctcaggaccc cacccgctta gctccatttc ctccagacgc caccaccacg 192 0 
cgtcccggag tgccccctcc taaagctccc agccgtccac catgctgtgc gttcctccct 198 0 
ccctggccac ggcagtgacc cttctctccc gggccctgct tccctctcgc gggctctgct 2 04 0 
gcctcactta ggcagcgctg cccttactcc tctccgcccg gtccgagcgg cccctcagct 2100 
tcggcgccca gccccgcaag gctcccggtg accactagag ggcgggagga gctcctggcc 2160 
agtggtggag agtggcaagg aaggacccta gggttcatcg gagcccaggt ttactccctt 222 0 
aagtggaaat ttcttccccc actcctcctt ggctttctcc aaggagggaa cccaggctgc 2280 
tggaaagtcc ggctgggggg gggactgtgg gttcagggga gaacggggtg tggaacggga 234 0 
cagggagcgg ttagaagggt ggggctattc cgggaagtgg tggggggagg gagcccaaaa 2400 
ctagcaccta gtccactcat tatccagccc tcttatttct cggccgctct gcttcagtgg 24 60 
acccggggag ggcggggaag tggagtggga gacctagggg tgggcttccc gaccttgctg 252 0 
tacaggacct cgacctagct ggctttgttc cccatcccca cgttagttgt tgccctgagg 25 8 0 
ctaaaactag agcccagggg ccccaagttc cagactgccc ctcccccctc ccccggagcc 264 0 
agggagtggt tggtgaaagg gggaggccag ctggagaaca aacgggtagt cagggggttg 2700 
agcgattaga gcccttgtac cctacccagg aatggttggg gaggaggagg aagaggtagg 2760 
aggtagggga gggggcgggg ttttgtcacc tgtcacctgc tcgctgtgcc tagggcgggc 2B2 0 
gggcggggag tggggggacc ggtataaagc ggtaggcgcc tgtgcccgct ccacctctca 2 88 0 
agcagccagc gcctgcctga atctgttctg ccccctcccc acccatttca ccaccaccat 294 0 
gacaccgggc acccagtctc ctttcttcct gctgctgctc ctcacagtgc ttacaggtga 300 0 
ggggcacgag gtggggagtg ggctgccctg cttaggtggt cttcgtggtc tttctgtggg 3060 
ttttgctccc tggcagatgg caccatgaag ttaaggtaag aattgcagac agaggctgcc 312 0 
ctgtctgtgc cagaaggagg gagaggctaa ggacaggctg agaagagttg cccccaaccc 318 0 
tgagagtggg taccaggggc aagcaaatgt cctgtagaga agtctagggg gaagagagta 324 0 
gggagaggga aggcttaaga ggggaagaaa tgcaggggcc atgagccaag gcctatgggc 33 0 0 
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agagagaagg aggctgctgc agggaaggag 
cactccccag tcctcctggt attatttctc 
tcttattttt ccttcataaa gacccaaccc 
cccctaaacc cgcaacagtt gttacaggtt 
aaaaggagac ttcggctacc cagagaagtt 
tgagtatgac cagcagcgta ctctccagcc 
agggacagga tgtcactctg gccccggcca 
ggggacagga tgtcacctcg gtcccagtca 
cagcccacga tgtcacctca gccccggaca 
cagcccacgg tgtcacctcg gccccggaca 
cagcccatgg tgtcacctcg gccccggaca 
cagtccacaa tgtcacctcg gcctcaggct 
acaacggcac ctctgccagg gctaccacaa 
ttcccagcca ccactctgat actcctacca 
ccagtagcac tcaccatagc acggtacctc 
cccagttgtc tactggggtc tctttctttt 
ttaattcctc tctggaagat cccagcaccg 
ctgaaatggt gagtatcggc ctttccttcc 
gtccacaccc tttgcatcaa gcccgagtcc 
ataaacaagg gggtttfcctg ggcctctcca 
gacccagtgt ggtggtfcgga gggttgggtg 
cttaaggttg ggggaagagt get gage cag 
ccctgtgacc aggecaggat ctgtggtggt 
catcaatgtc cacgacgtgg agacacagtt 
atataacctg acgatctcag aegtcagegg 
catgeegggg cccctctcct tccagtgtct 
egggagggge gcctcctctg ggagactgee 
tgtgccattt cctttctctg cccagtctgg 
gcfcggtgctg gtctgtgttc tggttgcgct 
tgcagtccct ggccctgatc agagcccccc 
ctcctatctc cccaggctgt ctgtcagtgc 
tttccagccc gggataccta ccatcctatg 
cgctatgtgc cccctagcag taccgatcgt 
caggecaggg gaagcagagg gtttggctgg 
cccaaagagc ttggaagagg tgagaagtgg 
gatgaggggc agaggtcaga ggagttttgg 
aaggggcetc aagagggagt ggccccactg 
cacattcatg ctggctggcg ctggctgaac 
tgcttttttg cacccagagg caaaatgggt 
aggagtccag gggtgagcct ctgtgatccc 
ccgagaaaag gctggcatag ggggagtcag 
gcagaccagg tgagcgtggg tgccagtggg 
gctccctcct tcctctcctg gtctttctct 
ectgaggefcg gaaaaccact ccaggtgggg 
tttcctcctc ctggagacct ccctctctcg 
cccaaaacac acacacacac acacacacac 
ggagggacaa gggggctgat tagagecaag 
gggcagccct gtttacagtc acctggctgg 
cctttgagag ctggccagga ctctggactg 
ctctaggagg taccttttgc tccfccaccct 
caggtaatgg tggcagcagc ctctcttaca 
acttgtaggg gcacgtcgcc cgctgagctg 
caggttcttc agggecagag cccctgcacc 
tgggctgetc acacgtcctt cagaggcccc 
gaagctcatg tgggcccctg aggctcatgc 
aggac t ggee c agagagee c t gaga t ageg 
gtctcccact ggcgccaact tctgatcttt 
gaatgtgtgt gagggggctg ggggaggaga 
tttgtttgag aagcaggaga tgtgaggagg 
ggagccacct ctggctaacc ctggcagcac 



gcttccaacc caggggttac tgaggctgee 3 36 0 
tggtggccag agcttatatt ttcttcttgc 342 0 
tatgacttta acttcttaca gctaccacag 3480 
ctggtcatgc aagctctacc ccaggtggag 3540 
cagtgcccag ctctactgag aagaatgctg 3600 
acagccccgg ttcaggctcc tccaccactc 3660 
cggaaccagc ttcaggttca gctgccacct 3720 
ccaggccagc cctgggctcc accaccccgc 3 780 
acaagccagc cccgggctcc accgcccccc 3840 
ccaggccggc cccgggctcc accgcccccc 3 900 
acaggcccgc cttgggctcc accgcccctc 3 960 
ctgeatcagg ctcagcttct actctggtgc 4 02 0 
ccccagccag caagagcact ccattctcaa 40 80 
cccttgccag ccatagcacc aagactgatg 414 0 
ctctcacctc ctccaatcac agcacttctc 42 00 
tcctgtcttt tcacatttca aacctccagt 4260 
actactacca agagctgeag agagacattt 4320 
ccatgctccc ctgaagcagc catcagaact 4380 
tttccctctc accccagttt ttgcagattt 4440 
atattaagtt caggtacagt tctgggtgtg 4500 
gtggtcatga cegtaggagg gactggtgca 4560 
agctgggacc cgtggctgaa gtgcccattt 462 0 
acaattgact ctggccttcc gagaaggtac 4680 
caatcagtat aaaacggaag cagcctctcg 4740 
tgaggctact tccctggctg cagccagcac 4800 
gggtccccgc tctttcctta gtgctggcag 4860 
ctgaccactg cttttccttt tagtgagtga 4920 
ggctggggtg ccaggctggg gcatcgcgct 4 98 0 
ggccattgtc tatctcattg ccttggtgag 5 04 0 
ggtagaaggc actccatggc ctgccataac 5100 
cgecgaaaga actaegggea gctggacatc 5160 
agegagtace ccacctacca cacccatggg 5220 
agcccctatg agaaggtgag attggcccca 5280 
gcaaggattc tgaagggggt acttggaaaa 5340 
cgtgaagtga gcaggggagg gcctggcaag 5400 
gggacaggee tgggaggaga ctatggaaga 5460 
ccagaattcc taaaaagatc attggccgtc 5520 
tggtgccacc gtggcagttt tgttttgttt 55 8 0 
ggagcactat geccagggga gcccttcccg 5640 
ctaatcaatc tcctaggaat ggagggtaga 57 00 
tttcccaggt agaagcaaga agaagtgtca 57 60 
gttcttggga gcttcaagga agcaaggaac 5 820 
atgggaccta gtaaataatt actgcagcca 5 880 
gaggagagag tttagttttc ttgetcctat 5940 
gctttacaaa gacacagata caccccgccc 6000 
acacctcctt aggctggaac agcagagaat 6060 
aagagggagt gaaggagagc agagggagga 612 0 
tggggtggca ggtgctctct ctgaattaac 6180 
attaccccag cctggggtgg catccagggg 624 0 
ggatctcttt tccttccacc caggtttctg 63 00 
caaacccagc agtggcagcc acttctgcca 63 60 
agtggccagc cagtgecatt ccactccact 6420 
ctgtttgggc tggtgagctg ggagttcagg 64 80 
accaatttct cggacacttc tcagtgtgtg 6540 
ctgggaagtg ttgtggtggg ggctcccagg 6600 
gggatcctga actggactga ataaaaegtg 6660 
catctgtgac ccgtgggcag cagggegtea 6720 
cagggaggee aggaggcagt aaggagegag 6780 
aggtgacatt SSSSagtaqg ggtggcctga 684 0 
aagaggaagg aggaaacgaa acccaggcng 6900 
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gctttggagg gctagcgtga ctgggctccg tgactgagct ctgtgtgcca gtggctctcc 6960 
cctctcctcg cctggcccac gccctccttg cccctggcat ggfcgcccccc aggtggctct 702 0 
attcttagct gtccgggtgt gaagtaaatc cttgggcagt gataacagcc cagagtcaac 708 0 
agggttgaga taagcagagg ctgggtcaga tccgggcgct ggcaccaggc ccagccccct 714 0 
ccctgacccc ggctncccca ccagcctgct: gcccctgggg tggnctccac aacaccctgg 72 0 0 
gaatggggaa gtggttctgg ttccctgacc cctttggccc aggcacgttg cctgtccctc 7260 
gaccgcattc ccccagggcc tgtgctgcag gcctggaagc cctgattggg gcctgccacc 7320 
agcagccaga gagctatgtt ccctggcagc tgtgatgcgc tcaggccggg ccaggacacg 7380 
tgtggcagga ggcttagagc acctgcctgg ggccttcctc tctcaggcac cagatccatt 7440 
ggttgctcct gcctagaacc acagcctagc acccctgctc cctcccgcct accacaccca 7500 
gcacagaaac tcacaggaat gattgcgctc agggaaggca gagatgtgcc tggcatcaca 7 560 
gtttatfcgtt tataaaccat gacaataaca gctgttgctc agcacaggcc tagcagagcc 7620 
cactgcaggg ggacggcagc gggcaccaga ggccttgcct ggcccaaccc aatgggaaca 768 0 
cccagactca gctgggtccc caagggagac ttggcacatt ggcatgggtg tgggacaggt 774 0 
aaagcatgca agagggagaa gagggacata aggggcatgc ggctgcgggg tgttgggacc 780 0 
caaataaata aagcaggatg acagggtccc cttcccctca ccaggaatgc ctgacagcgt 78 60 
ccagccccaa agcctgcctg tcccaaggct gtagttcagc atcaacaggg cagggagctt 7920 
ggcagggcaa gggcagagct ggagatcatg cccagtnttc caggtgccct ccctcccaat 79 80 
cagcctgggg ggcacaggac agggatggag aaggggctct ctccatggct tgggtaacat 8 040 
gccaaaggca ggtcataggg cagactcagt gggggtgggg gcctggctaa caagcaatgg 8100 
agagaacggg ggccatccag agaggttggc agaagagagc ccctgggtca agagaaaact 8160 
ttggggaaga caagacacgg gagaag 8186 



2- 3 r end o£ MUC1 gene (contains exon 7, polyA signal and flanking 
region) 

I (SEQ ID NO: 24} 

ggtacctttt gctcctcacc ctggatctct tttccttcca cccaggtttc tgcaggtaat 60 
ggtggcagca gcctctctta cacaaaccca gcagtggcag ccacttctgc caacttgtag 120 
gggcacgtcg cccgctgagc tgagtggcca gccagtgcca ttccactcca ctcaggttct 18 0 
tcagggccag agcccctgca ccctgtttgg gctggtgagc tgggagttca ggtgggctgc 24 0 
tcacacgtcc ttcagaggcc ccaccaattt ctcggacact tctcagtgtg tggaagctca 3 00 
tgtgggcccc tgaggctcat gcctgggaag tgttgtggtg ggggctccca ggaggactgg 3 60 
cccagagagc cctgagatag cggggatcct gaactggact gaataaaacg tggtctccca 420 
ctggcgccaa cttctgatct ttcatctgtg acccgtgggc agcagggcgt cagaatgtgt 480 
gtgagggggc tgggggagga gacagggagg ccaggaggca gtaaggagcg agtttgtttg 540 
agaagcagga gatgtgagga ggaggtgaca ttggggagta ggggtggcct gaggagccac 600 
ctctggctaa ccctggcagc acaagaggaa ggaggaaacg aaacccaggc gggctttgga 660 
gggctagcgt gactgggctc cgtgactgag ctctgtgtgc cagtggctct cccctctcct 720 
cgcctggccc acgccctcct tgcccctggc atggtgcccc ccaggtggct ctattctfcag 78 0 
ctgtccgggt gtgaagtaaa tccttgggca gtgataacag cccagagtca acagggttga 84 0 
gataagcaga ggctgggtca gatccgggcg ctggcaccag gcccagcccc ctccctgacc 90 0 
ccggctnccc caccagcctg ctgcccctgg ggtggnctcc acaacaccct gggaatgggg 960 
aagtggttct ggttccctga cccctttggc ccaggcacgt tgcctgtccc tcgaccgcat 102 0 
tcccccaggg cctgtgctgc aggcctggaa gccctgattg gggcctgcca ccagcagcca 1080 
gagagctatg ttccctggca gctgtgatgc gctcaggccg ggccaggaca cgtgtggcag 1140 
gaggcttaga gcacctgcct ggggccttcc tctctcaggc accagatcca ttggttgctc 1200 
ctgcctagaa ccacagccta gcacccctgc tccctcccgc ctaccacacc cagcacagaa 1260 
actcacagga atgattgcgc tcagggaagg cagagatgtg cctggcatca cagtttattg 1320 
tttataaacc atgacaataa cagctgttgc tcagcacagg cctagcagag cccactgcag 1380 
ggggacggca gcgggcacca gaggccttgc ctggcccaac ccaatgggaa cacccagact 144 0 
cagctgggtc cccaagggag acttggcaca ttggcatggg tgtgggacag gtaaagcatg 1500 
caagagggag aagagggaca taaggggcat gcggctgcgg ggtgttggga cccaaataaa 1560 
taaagcagga tgacagggtc cccttcccct caccaggaat gcctgacagc gtccagcccc 1620 
aaagcctgcc tgtcccaagg ctgttgttca gcatcaacag gggagggagc ttggcagggc 168 0 
aagggcagag ctggagatca tgcccagtgt tccaggtgcc ctccctccca atcagcctgg 174 0 
gggggacagg acagagattg agaagggggt ctctccatgg cttgggttac attccaaagg 18 0 0 
cagatcatag ggcagactca ctgggggtgg ggggc 1835 



36 



WO 03/031569 



PCT/US02/29640 



3. 5* end of MUC1 gene (contains 
t {SEQ ID NO:25) 

First ATG is shown as last three 

gaattcagaa ttttagaccc tttggccttg 
ctacagcccc tcagccaacc acagaccctt 
agggtggtca cccacccttc agggatgaga 
ccccaggaaa gatgacagca cattgccata 
ctgcttattt tagggatttt ccgggatagg 
agagaaagcc tgggagggtg gaagttaggg 
gcgggtgggg gaggtgctgg atggagttaa 
ccgaagtcct cccagaccca cttactcacg 
gatccgccgg gatggagagc caggaggcgg 
ggggccgggg ggcggacggg acgagtgcaa 
gcgtcccgtc cctcccgccc gcccgggccc 
ccgacccgga acccgggccg ctggggagtt 
cgcccggaag caaagggaat ggcaaggggg 
tccagaaggc cgctttccga cgcccgggcg 
ctgcgcggcg cgagccgtcc gcatggtgac 
ttgcttggca atgccgggag ttgaggtttg 
ccggcctgcg agttgggggg ctcccgcctc 
gatgtcccca ccccgcccag cgcctgcccc 
cgggcgccgg cgcgcgggcg agagcggtgg 
tcccgtgcct cgccgaagtg tttttaaagt 
gaactgtttt ctgaatacag gcggaactgc 
cgggacccaa gttccacgtg ctgccgcggc 
tgccgcacac acctcttggc tgtcgcgcat 
ctacgtgcct acctgtcccc aataccactc 
aatcccattc tgtcacccca cctactctct 
tttgctctgt cgcccaggct ggagtgcaat 
ctcccgggtt caagcgattc tcctgcctca 
gccaccacgc tcggctaatt tttgtagttt 
caggctggtc ttgaacccct gaccttgtga 
gattacgggc gtgacgaccg tgccacgcat 
gaacgtgtcc aactcccccg cccacgttcc 
ttccacatac ctcaggaccc cacccgctta 
cgtcccggag tgccccctcc taaagctccc 
ccctggccac ggcagtgacc cttctctccc 
gcctcactta ggcagcgctg cccttactcc 
tcggcgccca gccccgcaag gctcccggtg 
agfcggtggag agtggcaagg aaggacccta 
aagtggaaat ttcttccccc actcctcctt 
tggaaagtcc ggctggggcg gggactgtgg 
cagggagcgg ttagaagggt ggggctattc 
ctagcaccta gtccactcat tatccagccc 
acccggggag ggcggggaag tggagtggga 
tacaggacct cgacctagct ggctttgttc 
ctaaaactag agcccagggg ccccaagttc 
agggagtggt tggtgaaagg gggaggccag 
agcgattaga gcccttgtac cctacccagg 
aggtagggga gggggcgggg ttttgtcacc 
gg^cggggag tggggggacc ggtataaagc 
agcagccagc gcctgcctga atctgttctg 

9 



promoter and first ATG) 

residue s be low : 

gggtccatcc tggagaccct gaggtctaag 60 
ctctggctcc caaaaggagt tcagtcccag 12 0 
agttttcaag gggtattact caggcactaa 180 
aagttttggt tgttttctaa gccagtgcaa 24 0 
9 t 9999^agt ggaaggaatc ggcgagtaga 3 00 
atctagggga agtttggctg atttggggat 360 
gtgaaggata gggtgcctga gggaggatgc 420 
gtggcagcgg cgacactcca gtctatcaaa 480 
gggctgcccc tgaggtagcg gggaggccgg 540 
tattggcggg ggaaaaaaca acactgcacc 600 
ggatcccgct ccccaccgcc tgaagccggc 660 
gggttcacct tggaggccag agagacttgg 72 0 
aggggggagg gagaacggga gtttgcggag 780 
ttgcgcgcgc ttgctcttta agtactcaga 840 
gcgtgtccca gcaaccgaac tgaatggctg 900 
gggccgccca cctagctact cgtgttttct 960 
cccggcccgc tcctgggcgc gctgacgtca 102 0 
aagggtctcg ccgcacacaa agctcggcct 1080 
tctctcgcct gctgatctga tgcgctccaa 114 0 
gttctttcca acctgtgtct ttggggctga 1200 
ttccgtcggc ctagaggcac gctgcgactg 1260 
ctgggatagc ttcctcccct cgtgcactgc 1320 
tacgcacctc acgtgtgctt ttgccccccg 1380 
tgctccccaa aggatagttc tgtgtccgta 1440 
gcccccccct tttttgtttt gagacggagc 1500 
ggcgcgatct cggctcactg caacctccgc 1560 
gcctcctgag tagctggggt tacagcgccc 162 0 
ttagtagaga cgaggtttca ccatcttggc 1680 
tccactcgcc tcggccttcc aaagtgttgg 1740 
ctgcctctta agtacataac ggcccacaca 1800 
aacgtcctct cccacatacc tcggtgcccc 1860 
gctccatttc ctccagacgc caccaccacg 1920 
agccgtccac catgctgtgc gttcctccct 1980 
gggccctgct tccctctcgc gggctctgct 2040 
tctccgcccg gtccgagcgg cccctcagct 2100 
accactagag ggcgggagga gctcctggcc 2160 
gggttcatcg gagcccaggt ttactccctt 2220 
ggctttctcc aaggagggaa cccaggctgc 22 80 
gttcagggga gaacggggtg tggaacggga 234 0 
cgggaagtgg tggggggagg gagcccaaaa 24 0 0 
tcttatttct cggccgctct gcttcagtgg 2460 
gacctagggg tgggcttccc gaccttgctg 2520 
cccatcccca cgttagttgt tgccctgagg 2580 
cagactgccc ctcccccctc ccccggagcc 2640 
ctggagaaca aacgggtagt cagggggttg 2700 
aatggttggg gaggaggagg aagaggtagg 2760 
tgtcacctgc tcgctgtgcc tagggcgggc 282 0 
ggtaggcgcc tgtgcccgct ccacctctca 2880 
ccccctcccc acccatttca ccaccaccat 2 94 0 

2941 



4 • Differentially spliced forms of MTTC1 
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a. cDNA sequence of "MDTC1 seq rt : (SEQ ID NO: 26) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 

20 25 30 

Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 45 

Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His 
50 55 60 

Ser Pro Gly Ser Gly Ser Ser Thr Thr Gin Gly Gin Asp Val Thr Leu 
65 70 75 80 

Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gin 

85 90 95 

Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr 

100 105 110 

Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro 
115 120 125 

Gly Ser Thr Ala Pro Pro Ala Gin Gly Val Thr Ser Ala Pro Glu Thr 
130 135 140 

Arg Pro Pro Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser 
145 150 155 160 

Ala Pro Asp Asn Arg Pro Ala Leu Ala Ser Thr Ala Pro Pro Val His 

165 170 175 

Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu 

180 185 190 

Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys 
195 200 205 

Ser Thr Pro Phe Ser lie Pro Ser His His Ser Asp Thr Pro Thr Thr 
210 215 220 

Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser 
225 230 235 240 

Thr Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gin Leu 

245 250 255 

Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His lie Ser Asn Leu 

260 265 270 

Gin Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gin Glu 
275 280 285 

Leu Gin Arg Asp lie Ser Glu Met Val Ser lie Gly Leu Ser Phe Pro 
290 295 300 

Met Leu Pro 
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: (SEQ ID NO: 27) 

gagctcctgg ccagtggtgg agagtggcaa 
gtttactccc ttaagtggaa atttcttccc 
ggaaccccag gctgctggaa agtccggctg 
tgcgtgtgga acgggacagg gagcggttag 
9ra999 a OT9f agcccaaaac tagcacctag 
ggccgcctct gcttcagtgg acccggggag 
tgggcttccc gaccttgctg tacaggacct 
gttagttgtt gccctgaggc taaaactaga 
tcccccctcc cccggagcca gggagtggtt 
acgggtagtc aggggttgca gcattagagc 
gagagaagag tagagtaggg aggggggttt 
ggcgggcggg ggggagtggg gggaccggta 
ctctcaagca gccagcgcct gcctgaatct 
caccatgaca ccgggcaccc agtctccttt 
aggtgagggg cacgaggtgg ggagtgggct 
tgtgggttfct gctccctggc agatggcacc 
ctgccctgtc fcgtgccagaa ggagggagag 
aaccctgaga gtgggtacca ggggcaagca 
gagtagggag agggaaggct taagagggga 
tgggcagaga gaaggaggct gctgcaggaa 
tgcccactcc ccagtcctcc tggtattatt 
tgctcttatt tttccttcat aaagacccaa 
cagcccctgg gcccgcaaca gttgtfcacag 
gagaaaagga gacttcggct acccagagaa 
ctgtgagtat gaccagcagc gtactctcca 
ctcagggaca ggatgtcact ctggccccgg 
cctggggaca ggatgtcacc tcggtcccag 
cgccagccca cgatgtcacc tcagccccgg 
ccccagccca gggtgtcacc tcggccccgg 
ccccagccca tggtgtcacc tcggcgccgg 
ctccagtcca caatgtcacc tcggcctcag 
tgcacaacgg cacctctgcc agggctacca 
caattcccag ccaccactct gatactccta 
atgccagtag cactcaccat agcacggtac 
ctccccagtt gtctactggg gtctctttct 
agtttaattc ctctctggaa gatcccagca 
tttctgaaat ggtgagtatc ggcctttcct 
actgtccaca ccctttgcat caagcctgag 
tttataaaca agggggtttt ctgggcctct 
gtggacccag tgtggtggfct ggaggggtgg 
gtgcacttaa ggttggggga agagtgctga 
catttccctg tgaccaggcc aggatctgtg 
ggtaccatca atgtccacga cgtggagaca 
tcfccgatata acctgacgat ctcaagacgt 
agcaccatgc cggggcccct ctccttccag 
ggcagcggga ggggcgcctc ctctgggaga 
agtgatgtgc catttccttt ctctgaccag 
gcgctgctgg tgctggtctg tgttctggtt 
gtgagtgcag tccctggccc tgatcagagc 
ataacctcct atcfcccccag gctgtctgtc 
acatctttcc agcccgggat acctaccatc 
atgggcgcta tgtgccccta gcagtaccga 
ccccacaggc aggggaagca gagggtttgg 
aaaacccaaa gagctfcggaa gaggtgagaa 
aaggatgagg ggcagaggtc agaggagttt 



ggaaggaccc tagggttcat cggagcccag 60 
ccactcccct ccttggcttt ctccaaggag 12 0 
gggcggggac tgtgggtttc agggtagaac 180 
aagggtgggg ctattccggg aagtggtggt 240 
tccactcatt atccagccct cttatttctc 300 
ggcggggaag tggagtggga gacctagggg 3 60 
cgacctagct ggctttgttc cccatcccca 42 0 
gcccaggggc cccaagttcc agactgcccc 480 
ggtgaaaggg ggaggccagc tggagaagaa 54 0 
ccttgtagcc ctagcccagg aatggttgga 600 
gtcacctgtc acctgctcgg ctgtgcctag 660 
taaagcggta ggcgcctgtg cccgctccac 72 0 
gttctgcccc ctccccaccc atttcaccac 780 
cttcctgctg ctgctcctca cagtgcttac 84 0 
gccctgctta ggtggtcttc gtggtctttc 90 0 
agaagttaag gtaagaattg cagacagagg 960 
gctaaggaca ggctgagaag agttgccccc 1020 
aatgtcctgt agagaagtct agggggaaga 1080 
agaaatgcag gggccatgag ccaaggccta 1140 
9Sf a 99 c ^ cc aacccagggg ttactgaggc 1200 
tctctggtgg ccaggcttat attttcttct 1260 
ccctatgact ttaacttctt acagctacca 132 0 
gttctggtca tgcaagctct accccaggtg 1380 
gttcagtgcc cagctctact gagaagaatg 1440 
gccacagccc cggttcaggc tcctccacca 1500 
ccacggaacc agcttcaggt tcagctgcca 1560 
tcaccaggcc agccctgggc tccaccaccc 162 0 
acaacaagcc agccccgggc tccaccgccc 1680 
agaccaggcc gcccccgggc tccaccgccc 174 0 
acaacaggcc cgccttggcg tccaccgccc 1800 
gctctgcatc aggctcagct tctactctgg 1860 
caaccccagc cagcaagagc actccattct 192 0 
ccacccttgc cagccatagc accaagactg 1980 
ctcctctcac ctcctccaat cacagcactt 2040 
ttttcctgtc ttttcacatt tcaaacctcc 2100 
ccgactacta ccaagagctg cagagagaca 2160 
tccccatgct cccctgaagc agccatcaga 2220 
tcctttccct ctcaccccag tttttgcaga 2280 
ccaatattaa gttcaggtac agttctgggt 2340 
gtggtggtca tgagccgtag ggagggactg 2400 
gccagagctg ggacccgtgg ctgaagtgcc 2460 
gtggtacaat tgactctggc cttccgagaa 2520 
cagttcaatc agtataaaac ggaagcagcc 2580 
cagcggtgag gctacttccc tgctgcagcc 264 0 
tgtctgggtc cccgctcttt ccttagtgct 2700 
ctgccctgac cactgctttt ccttttagtg 2760 
tctggggcfcg gggtgccagg ctggggcatc 2820 
gcgctggcca ttgtctatct cattgccttg 2 88 0 
cccccggtag aaggcactcc atggcctgcc 2 940 
agtgccgccg aaagaactac gggcagctgg 3 00 0 
ctatgagcga gtaccccacc fcaccacaccc 3 06 0 
tcgtagcccc tatgagaagg tgagattggg 3120 
ctgggcaagg attctgaagg gggtacttgg 3180 
gtggcgtgaa gtgagcaggg gagggctggc 324 0 
tgggggacag gcctgggagg agactatgga 3 300 
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agaaaggggc ccctcaaaag ggagtgcccc actgccagaa ttc 3343 



to* DNA sequence of MUC1Y: (SEQ ID NOs28) 



Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 

20 25 30 

Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 45 

Thr Glu Lys Asn Ala Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp 
50 55 60 

Tyr Tyr Gin Glu Leu Gin Arg Asp lie Ser Glu Met Phe Leu Gin lie 
65 70 75 80 

Tyr Lys Gin Gly Gly Phe Leu Gly Leu Ser Asn lie Lys Phe Arg Pro 

85 90 95 

Gly Ser Val Val Val Gin Leu Thr Leu Ala Phe Arg Glu Gly Thr lie 

100 105 110 

Asn Val His Asp Val Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala 
115 120 125 

Ala Ser Arg Tyr Asn Leu Thr lie Ser Asp Val Ser Val Ser Asp Val 
130 135 140 

Pro Phe Pro Phe Ser Ala Gin Ser Gly Ala Gly Val Pro Gly Trp Gly 
145 150 155 160 

lie Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu Ala He Val 

165 170 175 

Tyr Leu He Ala Leu Ala Val Cys Gin Cys Arg Arg Lys Asn Tyr Gly 

180 185 190 

Gin Leu Asp lie Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu 
195 200 205 

Tyr Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr 
210 215 220 

Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser 
225 230 235 240 

Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu 

245 250 255 

: (SEQ ID NO? 29) 

atgacaccgg gcacccagtc tcctttcttc cfcgctgctgc tcctcacagt gcttacagtt 60 
gttacaggtt ctggtcatgc aagctctacc ccaggtggag aaaaggagac ttcggctacc 12 0 
cagagaagtt cagtgcccag ctctactgag aagaatgctt ttaattcctc tctggaagat 180 
cccagcaccg actactacca agagctgcag agagacattt ctgaaatgtt tttgcagatt 24 0 
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tataaacaag ggggttttct gggcctctcc aatattaagt tcaggccagg atctgtggtg 3 00 

gtacaattga ctctggcctt ccgagaaggt accatcaatg tccacgacgt ggagacacag 3 60 

ttcaatcagt ataaaacgga agcagcctct cgatataacc tgacgatctc agacgtcagc 42 0 

gtgagtgatg tgccatttcc tttctctgcc cagtctgggg ctggggtgcc aggctggggc 4 80 

atcgcgctgc tggtgctggt ctgtgttctg gttgcgctgg ccattgtcta tctcattgcc 540 

ttggctgtct gtcagtgccg ccgaaagaac tacgggcagc tggacatctt tccagcccgg 600 

gatacctacc atcctatgag cgagtacccc acctaccaca cccatgggcg ctatgtgccc 660 

cctagcagta ccgatcgtag cccctatgag aaggtttctg caggtaatgg tggcagcagc 72 0 

ctctcttaca caaacccagc agtggcagcc acttctgcca acttgtag 768 

C- MUC-1 AA: : (SEQ ID NO;30> 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly 

20 25 30 

Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser 
35 40 45 

Thr Glu Lys Asn Ala Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser 
50 55 60 

Phe His lie Ser Asn Leu Gin Phe Asn Ser Ser Leu Glu Asp Pro Ser 
65 70 75 80 

Thr Asp Tyr Tyr Gin Glu Leu Gin Arg Asp lie Ser Glu Met Ala Val 

85 90 95 

Cys Gin Cys Arg Arg Lys Asn Tyr Gly Leu Leu Asp lie Phe Pro Ala 

100 105 110 

Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His 
115 120 125 

Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys 
130 135 140 

Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala 
145 150 155 160 

Val Ala Ala Thr Ser Ala Asn Leu 

165 



: (SEQ ID HO : 3 1 ) 

ctccccaccc atttcaccac caccatgaca ccgggcaccc agtctccttt cttcctgctg 60 
ctgctcctca cagtgcttac agttgttaca ggttctggtc atgcaagctc taccccaggt 120 
ggagaaaagg agacttcggc tacccagaga agttcagtgc ccagctctac tgagaagaat 180 
gctttgtcta ctggggtctc tttctttttc ctgtcttttc acatttcaaa cctccagttt 240 
aattcctctc tggaagatcc cagcaccgac tactaccaag agctgcagag agacatttct 3 00 
gaaatggctg tctgtcagtg ccgccgaaag aactacgggc tgcfcggacafc ctttccagcc 3 60 
cgggatacct accatcctat gagcgagtac cccacctacc acacccatgg gcgctatgtg 42 0 
ccccctagca gtaccgatcg tagcccctat gagaaggttt ctgcaggtaa tggtggcagc 4 80 
agcctctctt acacaaaccc agcagtggca gccacttctg ccaacttgta ggggcacgtc 54 0 
gcc 543 
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d. cDNA of a variant of ^MUCIY" : (SBQ ID NO: 32) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

Val Leu Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys 

20 25 30 

Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro Ser Ser Thr Glu Lys 
35 40 45 

Asn Ala Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gin 
50 55 60 

Glu Leu Gin Arg Asp lie Ser Glu Met Phe Leu Gin lie Tyr Lys Gin 
65 70 75 80 

Gly Gly Phe Leu Gly Leu Ser Asn lie Lys Phe Arg Pro Gly Ser Val 

85 90 95 

Val Val Gin Leu Thr Leu Ala Phe Arg Glu Gly Thr lie Asn Val His 

100 105 110 

Asp Met Glu Thr Gin Phe Asn Gin Tyr Lys Thr Glu Ala Ala Ser Arg 
115 120 125 

Tyr Asn Leu Thr lie Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro 
130 135 140 

Phe Ser Ala Gin Ser Gly Ala Gly Val Pro Gly Trp Gly lie Ala Leu 
145 150 155 160 

Leu Val Leu Val Cys Val Leu Val Ala Leu Ala lie Val Tyr Leu lie 

165 170 175 

Ala Leu Ala Val Cys Gin Cys Arg Arg Lys Asn Tyr Gly Gin Leu Asp 

180 185 190 

lie Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr 
195 200 205 

Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser 
210 215 220 

Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr 
22S 230 235 140 

Thr Asn Pro Ala Val Ala Ala Thr Ser Ala Asn Leu 

245 250 

: (SEQ ID NO:33) 

atgacaccgg gcacccagtc tcctttcttc ctgctgctgc tcctcacagt gcttacaggt 60 
tctggtcatg caagctctac cccaggtgga gaaaaggaga cttcggctac ccagagaagt 120 
tcagtgccca gctctactga gaagaatgct tttaattcct ctctggaaga tcccagcacc 180 
gactactacc aagagctgca gagagacatt tctgaaatgt ttttgcagat ttataaacaa 240 
gggggttttc tgggcctctc caatattaag ttcaggccag gatctgtggt ggtacaattg 3 00 
actctggcct tccgagaagg taccatcaat gtccacgaca tggagacaca gttcaatcag 3 60 
tataaaacgg aagcagcctc tcgatataac ctgacgatct cagacgtcag cgtgagtgat 4 20 
gtgccatttc ctttctctgc ccagtctggg gctggggtgc caggctgggg catcgcgctg 4 80 
ctggtgctgg tctgtgttct ggttgcgctg gccattgtct atctcattgc cttggctgtc 540 
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tgtcagtgcc gccgaaagaa ctacgggcag 
catcctatga gcgagtaccc cacctaccac 
accgatcgta gcccctatga gaaggtttct 
acaaacccag cagtggcagc cacttctgcc 



ctggacatct ttccagcccg ggatacctac 600 
acccatgggc gctatgtgcc ccctagcagt 660 
gcaggtaatg gtggcagcag cctctcttac 72 0 
aacttgtag 759 



Reference: no published reference, only the database information 



e. Mu"ClX or MtXClE partial cDNA sequences ; (SEQ ID NO: 34) 



Met Thr Pro Gly Thr Gin Ser Pro 
1 5 

Val Leu Thr val Val Thr Gly Ser 

20 

Gly Glu Lys Glu Thr Ser Ala Thr 
35 40 

Thr Glu Lys Asn Ala Leu Ser Thr 

50 55 



Phe Phe Leu Leu Leu Leu Leu Thr 
10 15 

Gly His Ala Ser Ser Thr Pro Gly 
25 30 

Gin Arg Ser Ser Val Pro Ser Ser 

45 

Gly Val Ser Phe Phe Phe Leu Ser 

60 



Phe His lie Ser Asn Leu Gin Phe Asn Ser Ser Leu Glu 
65 70 75 



f , S81781, CDNA: (SEQ ID NOt35) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 



Val Leu Thr Ala Thr Thr Ala Pro 

20 

Ser Gly His Ala Ser Ser Thr Pro 
35 40 

Thr Gin Arg Ser Ser Val Pro Ser 
50 55 

Met Thr Ser Ser Val Leu Ser Ser 
65 70 



Lys Pro Ala Thr Val Val Thr Gly 
25 30 

Gly Gly Glu Lys Glu Thr Ser Ala 

45 

Ser Thr Glu Lys Asn Ala Val Ser 

60 

His Ser Pro Gly Ser Gly Ser Ser 
75 80 



Thr Thr Gin Gly 



Ser Gly Ser Ala 

100 



Gin Asp Val Thr 
85 

Ala Thr Trp Gly 



Leu Ala Pro Ala 
90 

Gin Asp Val Thr 
105 



Thr Glu Pro Ala 
95 

Ser 



: (SEQ ID NO: 36) 

accaccacca tgacaccggg cacccagtct 
cttacagcta ccacagcccc taaacccgca 
tctaccccag gtggagaaaa ggagacttcg 
actgagaaga atgctgtgag tatgaccagc 
ggctcctcca ccactcaggg acaggatgtc 
ggttcagctg ccacctgggg acaggatgtc 



cctttcttcc tgctgctgct cctcacagtg 60 
acagttgtta caggttctgg tcatgcaagc 120 
gctacccaga gaagttcagt gcccagctct 180 
agcgtactct ccagccacag ccccggttca 240 
actctggccc cggccacgga accagcttca 300 
acctcg 33 6 
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Reference: Int. J- Cancer 66 (l) , 55-59 (1936) 

g. M3273 8, partial cDNA of MTTC1 splice variant A: : (SEQ ID NO: 37) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 io 15 

Val Leu Thr Ala Thr Thr Ala Pro Lys Pro Ala Thr Val Val Thr Gly 

20 25 30 

Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala 
35 40 45 

Thr Gin Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser 
50 55 60 

Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser 
65 70 75 80 

Thr Thr Gin Gly Gin Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala 

85 90 95 

Ser Gly Ser Ala Ala Thr Trp Gly Gin Asp Val Thr Ser Val Pro Val 

100 105 110 

Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr 
115 120 125 

Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala 
130 135 140 

His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala 
145 150 155 

: (SEQ ID NO:38) 

gcgcctgcct gaatctgttc tgccccctcc ccacccattt caccaccacc atgacaccgg 60 
gcacccagtc tcctttcttc ctgctgctgc tcctcacagt gcttacagct accacagccc 120 
ctaaacccgc aacagttgtt acaggttctg gtcatgcaag ctctacccca ggtggagaaa 18 0 
aggagacttc ggctacccag agaagttcag tgcccagctc tactgagaag aatgctgtga 240 
gtatgaccag cagcgtactc tccagccaca gccccggttc aggctcctcc accactcagg 3 00 
gacaggatgt cactctggcc ccggccacgg aaccagcttc aggttcagct gccacctggg 3 60 
gacaggatgt cacctcggtc ccagtcacca ggccagccct gggctccacc accccgccag 4 20 
cccacgatgt cacctcagcc ccggacaaca agccagcccc gggctccacc gcccccccag 4 80 
cccacggtgt cacctcggcc ccggacacca ggccggcc 518 

Reference: J, Biol. Chem. 265, 5573-5578 (1990) 

h. Z17324, partial cDNA of MTJCl splice variant Cs (SEQ ID NO:39) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

Val Leu Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys 

20 25 30 

Glu Thr Ser Ala Thr Gin Arg Ser Ser Val Pro 
35 40 
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: {SEQ. ID NO:40} 

ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca 60 

tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac 120 

agtgcttaca ggttctggtc atgcaagctc taccccaggt ggagaaaagg agacttcggc 180 
tacccagaga agttcagtgc ccag 204 

Reference: no literature reference, a direct submission to the 
database 

i. Z17325, partial cDNA of MUC1 splice variant D 
: (SEQ ID NO: 41) 

Met Thr Pro Gly Thr Gin Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr 
15 10 15 

Val Leu Thr Gly Gly Glu Lys Glu Thr Ser Ala Thr Gin Arg Ser Ser 

20 25 30 

Val Pro 



; (SEQ ID NO: 42) 

ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca 60 
tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac 12 0 
agtgcttaca ggtggagaaa aggagacttc ggctacccag agaagttcag tgcccag 177 

5. CTL epitopes of MtXCl? J (SEQ ID NO: 43) 

Ser Thr Ala Pro Pro Val His Asn Val 
1 5 



Reference: Blood 93:4309-4317, 1999 
: (SEQ ID NOs44) 

Leu Leu Leu Leu Thr Val Leu Thr Val 

1 5 



Reference: Blood 93:4309-4317, 1999 
: (SEQ ID NO: 45) 

Ser Thr Ala Pro Pro Ala His Gly Val 
1 5 



Reference J Immunology 155:4766-4774, 1995; J Immunology 159:5211- 
5218, 1997 

: (SEQ ID NO:46) 

Ala Pro Asp Thr Arg Pro Ala 
1 5 



Reference J Immunology 159:5211-5218, 1997 
6, CD4 T helper epitopes of MUC1 
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5 (SEQ ID N0s47) 

Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr 
15 10 

for HLA DR3 Reference: Cancer Research 58: 5 066^507 0, 1998 
IV, Sequences for DNA vaccine vectors: 

1- HCMV promoter /enhancer; K014 84 Mark Stinski U Iowa 

4 90 bp of promoter sequence, to transcriptional start 
: (SEQ ID N0i48 

ggcgaccgcc cagcgacccc cgcccgttga cgtcaatagt gacgtatgtt cccatagtaa 60 

cgccaatagg gactttccat tgacgtcaat gggtggagta tttacggtaa actgcccact 120 

tggcagtaca tcaagtgtat catatgccaa gtccgccccc tattgacgtc aatgacggta 18 0 

aatggcccgc ctagcattat gcccagtaca tgaccttacg ggagtttcct acttggcagt 24 0 

acatctacgt attagtcatc gctattacca tggtgatgcg gttttggcag tacaccaatg 3 00 

ggcgtggata gcggtttgac tcacggggat ttccaagtct ccaccccatt gacgtcaatg 360 

ggagtttgtt ttggcaccaa aatcaacggg actttccaaa atgtcgtaat aaccccgccc 42 0 

cgttgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatatagca gagctcgttt 4 80 

agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca 54 0 

ccgggaccga tccagcctcc gcggccggga acggtgcatt ggaacgcgga ttccccgtgc 60 0 
caagagtgac gtaagt 616 

Reference: J. Virol. 49, 190-199 (1984); Proc. Natl. Acad. Sci, ULS.A* 
81, 659-663 (1984) 

2. HCMV promoter/ enhancer? K03104 
: {SEQ ID NO :49) 

737hp of promoter sequence, to +193bp; includes exon 1 and part of 
intron A 

aatcaatatt ggccattagc catattattc attggttata tagcataaat caatattggc 60 

tattggccat tgcatacgtt gtatccatat cataatatgt acatttatat tggctcatgt 12 0 

ccaacattac cgccatgttg acattgatta ttgactagtt attaatagta atcaattacg 18 0 

gggtcattag ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc 240 

ccgcctggct gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc 3 00 

atagtaacgc caatagggac tttccattga cgtcaatggg tggagtattt acggtaaact 3 60 

gcccacttgg cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat 420 

gacggtaaat ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact 4 80 

tggcagtaca tctacgtatt agtcatcgct attaccatgg tgatgcggtt ttggcagtac 540 

atcaatgggc gtggatagcg gtttgactca cggggatttc caagtctcca ccccattgac 600 

gtcaatggga gtttgttttg gcaccaaaat caacgggact ttccaaaatg tcgtaacaac 660 

tccgccccat tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta tataagcaga 720 

gctcgtttag tgaaccgtca gatcgcctgg agacgccatc cacgctgttt tgacctccat 780 

agaagacacc gggaccgatc cagcctccgc ggccgggaac ggtgcattgg aacgcggatt 840 

ccccgtgcca agagtgacgt aagtaccgcc tatagagtct ataggcccac ccccttggct 900 
tcttatgcat gctatactgt ttttggcttg 930 

Reference: cell 41:521-530, 1985 

3. HCMV promoter, exon 1, intron A and part of exon 2; M60321 
: (SEQ ID NO: 50) 

ctgcagtgaa taataaaatg tgtgtttgtc cgaaatacgc gttttgagat ttctgtcgcc 60 

gactaaattc atgtcgcgcg atagtggtgt ttatcgccga tagagatggc gatattggaa 12 0 

aaatcgatat ttgaaaatat ggcatattga aaatgtcgcc gatgtgagtt tctgtgtaac 180 
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tgatatcgcc atttttccaa aagtgatttt tgggcatacg cgatatctgg cgatacggct 240 
tatatcgttt acgggggatg gcgatagacg actttggcga cttgggcgat tctgtgtgtc 3 00 
gcaaatatcg cagtttcgat ataggtgaca gacgatatga ggctatatcg ccgatagagg 3 60 
cgacatcaag ctggcacatg gccaatgcat atcgatctat acattgaatc aatattggca 420 
attagccata ttagtcattg gttatatagc ataaatcaat attggctatt ggccattgca 480 
tacgttgtat ctatatcata atatgtacat ttatattggc tcatgtccaa tatgaccgcc 540 
atgttgacat tgattattga ctagttatta atagtaatca attacggggt cattagttca 600 
tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc ctcgtgaccg 660 
cccaacgacc cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata 72 0 
gggactttcc attgacgtca atgggtggag tatttacggt aaactgccca cttggcagta 780 
catcaagtgt atcatatgcc aagtccggcc ccctattgac gtcaatgacg gtaaatggcc 84 0 
cgcctggcat fcatgcccagt acatgacctt acgggacttt cctacttggc agtacatcta 90 0 
cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacacca atgggcgtgg 96 0 
atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt 1020 
gttttggcac caaaatcaac gggactttcc aaaatgtcgt aataaccccg ccccgttgac 10 80 
gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctc gtttagtgaa 1140 
ccgtcagatc gcctggagac gccatccacg ctgttttgac ctccatagaa gacaccggga 1200 
ccgatccagc ctccgcggcc gggaacggtg cattggaacg cggattcccc gtgccaagag 1260 
tgacgtaagt accgcctata gactctatag gcacacccct ttggctctta tgcatgctat 132 0 
actgtttttg gcttggggcc tatacacccc cgctccttat gctataggtg atggtatagc 13 8 0 
ttagcctata ggtgtgggtt attgaccatt attgaccact cccctattgg tgacgatact 144 0 
ttccattact aatccataac atggctcttt gccacaacta tctctattgg ctatatgcca 15 0 0 
atactctgtc cttcagagac tgacacggac tctgtatttt tacaggatgg ggtcccattt 1560 
attatttaca aattcacata tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa 1620 
catagcgtgg gatctccacg cgaatctcgg gtacgtgttc cggacatggg ctcttctccg 16 80 
gtagcggcgg agcttccaca tccgagccct ggtcccatgc ctccagcggc tcatggtcgc 1740 
tcggcagctc cttgctccta acagtggagg ccagacttag gcacagcaca atgcccacca 1800 
ccaccagtgt gccgcacaag gccgtggcgg tagggtatgt gtctgaaaat gagctcggag 1860 
attgggctcg caccgtgacg cagatggaag acttaaggca gcggcagaag aagatgcagg 192 0 
cagctgagtt gttgtattct gataagagtc agaggtaact cccgttgcgg tgctgttaac 1980 
39 t 99 a 9f99 c agtgtagtct gagcagtact cgttgctgcc gcgcgcgcca ccagacataa 2040 
tagctgacag actaacagac tgttcctttc catgggtctt ttctgcagtc accgtccttg 2100 
acacgatgga gtcctctgcc aagagaaaga tggaccctga taatcctgac gagggccctt 2160 
cctccaaggt gccacggtac gtgtcggggt ttgtgccccc cctttttttt ataaaattgt 2220 
attaatgtta tatacatatc tcctgtatgt gacccatgtg cttatgactc tatttctcat 22 80 
gtgtfctaggc ccgagacacc cgtgaccaag gccacgacgt tcctgcagac tatgttgagg 2340 
aaggaggtta acagtcagct g 2361 

Reference: KFucleic Acids Res. 19, 3979-3986 (1991) 

4. HCMV promoter /enhancer with upstream NF1 binding sites; includes 
114 Obp of upstream promoter with 748bp of exon 1 and intron A; 
X03922 

: (SEQ ID HO; 51 

ctgcagtgaa taataaaatg tgtgtttgtc cgaaatacgc gtttgagatt tctgtcccga 60 

ctaaattcat gtcgcgcgat agtggtgttt atcgccgata gagatggcga tattggaaaa 120 

atcgatattt gaaaatatgg catattgaaa atgtcgccga tgtgagtttc tgtgtaactg 180 

atatcgccat ttttccaaaa gttgattttt gggcatacgc gatatctggc gatacgctta 240 

tatcgtttac gggggatggc gatagacgcc tttggtgact tgggcgattc tgtgtgtcgc 300 

aaatatcgca gtttcgatat aggtgacaga cgatatgagg ctatatcgcc gatagaggcg 360 

acatcaagct ggcacatggc caatgcatat cgatctatac attgaatcaa tattggccat 420 

tagccatatt attcattggt tatatagcat aaatcaatat tggctattgg ccattgcata 480 

cgttgtatcc atatcataat atgtacattt atattggctc atgtccaaca ttaccgccat 54 0 

gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 600 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 660 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 720 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 7 80 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 840 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 900 

47 



WO 03/031569 



PCTYUS02/29640 



tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 960 
agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 1020 
tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 1080 
aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 1140 
gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 12 00 
gatccagcct ccgcggccgg gaacggtgca ttggaacgcg gattccccgt gccaagagtg 12 60 
acgtaagtac cgcctataga gtctataggc ccaccccctt ggcttcttat gcatgctata 132 0 
ctgtttttgg cttggggtct atacaccccc gcttcctcat gttataggtg atggtatagc 13 80 
ttagcctata ggtgtgggtt attgaccatt attgaccact cccctattgg tgacgatact 1440 
ttccattact aatccataac atggctcttt gcacaactct ctttattggc tatatgccaa 15 00 
tacactgtcc ttcagagact gacacggact ctgtattttt acaggatggg gtctcattta 1560 
ttatttacaa attcacatat acaacaccac cgtccccagt gcccgcagtt tttattaaac 1620 
ataacgtggg atctccagcg aatctcgggt acgtgttccg gacatggggc tcttctccgg 1680 
tagcggcgga gcttctacat ccagccctgc tcccatcctc ccactcatgg tcctcggcag 1740 
ctccttgctc ctaacagtgg aggccagact taggcacagc acgatgccca ccaccaccag 1800 
tgtgcccaca aggccgtggc ggtagggtat gtgtctgaaa atgagctc 1848 



Reference: EMBO J- 5 (6), 1367-1371 (1986) 

5. Various strains of HMCV IE promoter/ enhancer; these are different 
from each other at a few residues compared to the two sequences 
listed above in 1 and 2j M64940 -M64944 

M64940 
: <SEQ ID NO: $2) 

ggcacatggc caatgcatat cgatatatac attgaatcaa tattggctat tagccatatt 60 
agtcattggt tatatagcat aaatcaatat tggctaatgg ccattgcata cattgcagct 12 0 
atagcataat atgtacattt atattggctc atgtccaata tgaccgccat gttgacattg 180 
attattgact agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat 240 
ggagttcccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 300 
cccgcccatt gacgtcaata atgacgtgag ttcccatagt aacgccaata gggactttcc 360 
attgacgtca atgggaggag tatttacggt aaactgccca cttggcagta catcaagtgt 420 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 480 
atgcccagta catgacctta cgggactttc ctacttggca gtacatctac gtattagtca 540 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 600 
acfccacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 660 
aaattcaacg ggactttcca aaatgtcgta ataactccgc cccattgacg caaatgggcg 720 
gtaggcgtgt acgatgggtg gtctatataa gcagagctcg tttagtgaac cgtcagatcg 7 80 
cctggagacg ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc 840 
tccgcggccg ggaacggtgc attggaacgc ggattc 8 76 



M64941 

: (SEQ ID NO: 53} 

ggcacatggc caatgcatat cgatatatac attgaatcaa tattggccat tagccatatt 60 
agtcattggt tatatagcat aaatcaatat tggctaatgg ccattgcata cgttgcatct 12 0 
atatcataat gtgtacattt atattggctc atgtccaata tgaccgccat gttgacattg 180 
attattgact agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat 240 
ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc 3 00 
ccgcccattg acgtcaataa tgacgtgggt tcccatagta acgccaatag ggactttcca 3 60 
ttgacgtcaa tgggaggagt atttacggta aactgcccac ttggcagtac atcaagtgta 42 0 
tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg cctggcatta 4 80 
tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg tattagtcat 54 0 
cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat agcggtttga 600 
ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt tttggcacca 660 
aattcaacgg gactttccaa aatgtcgtaa taactccgcc ccattgacgc aaatgggcgg 720 
taggcgtgta ctatgggagg tctatataag cagagctcgt ttagtgaacc gtcagatcgc 780 
ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc gatccagcct 840 
ccgcggccgg gaacggtgca ttggaacgcg gattc 875 
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M64942 

: (SEQ ID NO:54) 

ggcacatggc caatgcatat cgatatatac attgaatcaa tattggctat tagccatatt 60 
agtcattggt tatatagcat aaatcaatat tggctaatgg ccattgcata cattgcagct 12 0 
atagcataat atgtacattt atattggctc atgtccaata tgaccgccat gttgacattg 18 0 
attattgact agttattaat agtaatcaat tacggggtca ttagttcata gcccatatat 24 0 
ggagttcccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 3 00 
cccgcccatt gacgtcaata ttgacgtgag ttcccatagt aacgccaata gggactttcc 360 
attgacgtca atgggtggag tatttacggt aaactgccca cthggcagta catcaagtgt 420 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 480 
atgcccagta catgacctta cgggactttc ctacttggca gtacatctac gtattagtca 540 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg GOO 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 660 
aaattcaacg ggactttcca aaatgtcgta ataactccgc cccattgacg caaatgggcg 720 
gtaggcgtgt acgattggga ggtctatata agcagagctc gtttagtgaa ccgtcagatc 780 
gcctggagac gccatccacg ctgttttgac ctccatagaa gacaccggga ccgatccagc 840 
ctccgcggcc gggaacggtg cattggaacg cggattc 877 



M64943 

: (SEQ ID NO; 55) 

ggcacatggc caatgcatat cgatctatac attgaatcaa tattggccat tagccatatt 60 
agtcattggt tatatagcat aaatcaatat tgactattgg ccattgcata cgttgtatcc 12 0 
atatcataat atgtacattt atattggctc atgtccaata tgaccgccat gttgacattg 18 0 
attattgact agttattaat agtaatcaat tacagggtca ttagttcata gcccatatat 24 0 
ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc ccaacgaccc 30 0 
ccgcccattg acgtcaataa cgacgtatgt tcccatagta acgctaatag ggactttcca 36 0 
ttgacgtcaa tgggaggagt atttacggta aactgcccac ttggcagtac atcaagtgta 42 0 
tcatatgcca agtacgcccc ccattgacgt caatgacggt aaatggcccg cctggcatta 48 0 
tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg tattagtcat 54 0 
cactattacc atggtgatgc ggttttggca gtacatcaat gggtgtggat agcggtttga 60 0 
ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt tttggcacca 660 
aaatcaacgg gactttccaa aatgtcgtaa taactccgcc ccattgacgc aaatgggcgg 720 
taggcgtgta cagtgggagg tctatataag cagagctcgt ttagtgaacc gtcagatcgc 780 
ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc gatccagcct 840 
ccgcggccgg gaacggtgca ttggaacgcg gatt 874 



M64944 

x (SEQ ID NO: 56) 

ggcacatggc caatgcatat cgatatatac 
agtcattggt tatatagcgt aaatcaatat 
atatcataat gtgtacattt atattggctc 
attattgact agttattaat agtaatcaat 
ggagttcccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtgag 
attgacgtca atgggtggag tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta cgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaattcaacg ggactttcca aaatgtcgta 
gtaggcgtgt actatgggag gtctatataa 
cctggagacg ccatccacgc tgttttgacc 
tccgcggccg ggaacggtgc attggaacgc 



attgaatcaa tattggccat tagccatatt 60 

tggctaatgg ccatcgcata cgttgcatct 12 0 

atgtccaata tgaccgccat gttgacattg 18 0 

tacggggtca ttagttcata gcccatatat 24 0 

atggcccgcc tggctgaccg cccaacgacc 300 

ttcccatagt aacgccaata gggactttcc 360 

aaactgccca cttggcagta catcaagtgt 42 0 

tcaatgacgg taaatggccc gcctggcatt 4 80 

ctacttggca gtacatctac gtattagtca 540 

agtacatcaa tgggcgtgga tagcggtttg 600 

ttgacgtcaa tgggagtttg ttttggcacc 660 

ataactccgc cccattgacg caaatgggcg 720 

gcagagctcg tttagtgaac cgtcagatcg 780 

tccatagaag acaccgggac cgatccagcc 84 0 
ggattc 876 



Reference: J. Clin. Microbiol. 29, 2494-2502 (1991) 
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6. SV40 polyadenylation signal (late and early); J02400 
s (SEQ ID NO: 57) 

ggggatccag acatgataag atacattgat gagfctfcggac aaaccacaac tagaatgcag 60 
tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttzatttgt aaccattata 12 0 
agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca ggttcagggg 180 
gaggtgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtat ggctgattat 240 
gatcatgaac 250 



Reference: Proc. Natl. Acad. Sci. U.S.A. 78 (13, 100-104 (1981) 

7. Rabbit (3globin intron 2; JO0S0O 
s (SEQ ID NO: 58) 

ggatcctgag aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat 60 

tgtaaaattc atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag 12 0 

atgtcccttg tatcaccafcg gaccctcatg ataattttgt ttctttcact ttctactctg 180 

ttgacaacca ttgtctccfcc ttattttctt ttcattttct gtaacttttt cgttaaactt 240 

tagcttgcat ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta 300 

ctttctctaa tcactttttt ttcaaggcaa tcagggtata tfcatattgta cttcagcaca 3 60 

gttttagaga acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc 420 

tggctggcgt ggaaatattc ttattggtag aaacaactac accctggtca tcatcctgcc 480 

tttctcttta tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc 540 

caaaccgggc ccctctgcta accatgttca tgccttcttc tctttcctac agctcctggg 600 
caacgtgctg 610 



References: Cell 10, 549-558 (1977); Cell 18, 1285-1297 (1979) 

8. Minimal synthetic rabbit pglobin polyadenylation signal 
s {SEQ ID NO? 59) 

aataaaagat ccagagctct agagatctgt gtgttggttt tttgtgtg 48 

Reference: Genes and Development 3: 1019-102 5, 1989 
V. IL-18 sequences to claim 

1. Mature consensus human IL-18 linked to an HC signal sequence, with 
intron included and underlined. Bold areas are from the KC signal 
sequence, and the unbolded are the linked mature human IL-18 
sequence 

s (SEQ ID NO:60) 

ATGGGGTCAACCGCCATCCTCGGCCTCCTCCTGGCTGTTCTCCAAG GTCAGTCCTGCCGAGGTCTTGAGG 
TCACAGAGGAGAACGGGTGGAAAGGAGCCCCTGATTCAAATTTTGTGTCTCCCCCACAG GAGTCTGTGCC 

tacttt ggcaagctt gaatctaaat tatcagtcat aagaaatttg aatgaccaag 
ttctcttcat tgaccaagga aatcggcctc tatttgaaga tatgactgat tctgactgta 
gagataatgc accccggacc atattfcatta taagtatgta fcaaagatagc cagcctagag 
gtatggctgt aactatctct gtgaagtgtg agaaaatttc aactctctcc tgtgagaaca 
aaattatttc ctttaaggaa atgaatcctc ctgataacat caaggataca aaaagtgaca 
tcatattctt tcagagaagt gtcccaggac atgataataa gatgcaattt gaatcttcafc 
catacgaagg atactttcta gcttgtgaaa aagagagaga cctttttaaa ctcattttga 
aaaaagagga tgaattgggg gatagatcta taatgttcac tgttcaaaac gaagactag 



t (SEQ ZD NO: 61) 

atggggtcaa ccgcca^tcct cggcctcctc 
ggtcttgagg tcacagagga gaacgggtgg 
cccccacagg agtctgtgcc tactttggca 
atttgaatga ccaagttctc ttcattgacc 



ctggctgttc tccaaggtca gtcctgccga 60 
aaaggagccc ctgattcaaa ttttgtgtct 120 
agcttgaatc taaattatca gtcataagaa 180 
aaggaaatcg gcctctattt gaagatatga 240 

50 



WO 03/031569 



PCT/US02/29640 



ctgattctga ctgtagagat aatgcacccc 
atagccagcc tagaggtatg gctgtaacta 
tctcctgtga gaacaaaatt atttccttta 
atacaaaaag tgacatcata ttctttcaga 
aatttgaatc ttcatcatac gaaggatact 
ttaaactcat tttgaaaaaa gaggatgaat 
aaaacgaaga ctag 



ggaccatatt tattataagt atgtataaag 300 
tctctgtgaa gtgtgagaaa atttcaactc 360 
aggaaatgaa tcctcctgat aacatcaagg 42 0 
gaagtgtccc aggacatgat aataagatgc 480 
ttctagcttg tgaaaaagag agagaccttt 54 0 
tgggggatag atctataatg ttcactgttc 600 

614 



s (SEQ ID NO: 62) 

ATGG GGTC AACCGCCATCCTCG GCCTCCTCCTGGCTGTTCTCCAAG GTCAGTCCTGCC 

GAGGTCTTGAGGTCACAGAGGAGAACGGGTGGAAAGGAGCCCC^ 

GTGTCTCCCCCACAGGAGTCTGTGCC 



atggggtcaa ccgccatcct cggcctcctc ctggctgttc tccaaggtca gtcctgccga 60 
ggtcttgagg tcacagagga gaacgggtgg aaaggagccc ctgattcaaa ttttgtgtct 12 0 
cccccacagg agtctgtgcc 14 0 

2 (SEQ ID NO: 63) 

tacttt ggcaagctt gaatctaaat tatcagtcat aagaaatttg aatgaccaag ttctcttcat tgaccaagga aatcggcctc 
tatttgaaga tatgactgat tctgactgta gagataatgc accccggacc atatttatta taagtatgta taaagatagc cagcctagag 
gtatggctgt aactatctct gtgaagtgtg agaaaattte aactctctscc tgtgagaaca aaattatttc ctttaaggaa atgaatcctc 
ctgataacat caaggataca aaaagtgaca tcatattctt tcagagaagt gtcccaggac atgataataa gatgcaattt gaatcttcat 
catacgaagg atactttcta gcttgtgaaa aagagagaga cctttttaaa ctcattttga aaaaagagga tgaattgggg gatagatcta 
taatgttcac tgttcaaaac gaagactag 



tactttggca agcttgaatc taaattatca gtcataagaa atttgaatga ccaagttctc 60 
ttcattgacc aaggaaatcg gcctctattt gaagatatga ctgattctga ctgtagagat 120 
aatgcacccc ggaccatatt tattataagt atgtataaag atagccagcc tagaggtatg 180 
gctgtaacta tctctgtgaa gtgtgagaaa atttcaactc tctcctgtga gaacaaaatt 24 0 
atttccttta aggaaatgaa tcctcctgat aacatcaagg atacaaaaag tgacatcata 3 00 
ttctttcaga gaagtgtccc aggacatgat aataagatgc aatttgaatc ttcatcatac 3 60 
gaaggatact ttctagcttg tgaaaaagag agagaccttt ttaaactcat tttgaaaaaa 42 0 
gaggatgaat tgggggatag atctataatg ttcactgttc aaaacgaaga ctag 4 74 



i (SEQ XD NO: 64) 

MGSTAI LGLLLAVLQGVCA 

Met Gly Ser Thr Ala lie Leu Gly Leu Leu Leu Ala Val Leu Gin Gly 
1 5 10 15 

Val Cys Ala 

: (SEQ XD NO:65) 

YFGKLESKLSVIRNLNDQVLFXDQGNRPLFEDMTDSDCRDNAPRTIFIISMYKDSQPRGMAVTISVKCEK 
I S TLSCENKI I S FKEMNPPDN IKDTKSD I IFFQRS VPGHDNKMQFESS S YEG YFLACEKERDLFKLI LKK 
EDELGDRSIMFTVQNED 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 
15 10 15 

Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe lie 
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35 40 45 

lie Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr lie 
50 55 60 

Ser Val Lys Cys Glu Lys lie Ser Thr Leu Ser Cys Glu Asn Lys lie 
65 70 75 80 

lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 

85 90 95 

Ser Asp He He Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 
115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu He Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

2. Mature consensus human IL-18 linked to a human LC signal sequence, 
with no intron. Bold areas are from the LC signal sequence, and the 
unbolded are the linked mature human IL-1S sequence* 

: (SEQ XD NO: 66) 

ATGGCCTGGACCGTTCTCCTCCTCGGCCTCCTCTCTCACTGCACAGGCTCTGTGACCTCC t ac 1 1 1 
ggcaagctt gaatctaaat tatcagtcat aagaaatttg aatgaccaag ttctcttcat 
tgaccaagga aatcggcctc tatttgaaga tatgactgat tctgactgta gagataatgc 
accccggacc atatttatta taagtatgta taaagatagc cagcctagag gtatggctgt 
aactatctct gtgaagtgtg agaaaatttc aactctctcc tgtgagaaca aaattatttc 
ctttaaggaa atgaatcctc ctgataacat caaggataca aaaagtgaca tcatattctt 
tcagagaagt gtcccaggac atgataataa gatgcaattt gaatcttcat catacgaagg 
atactttcta gcttgtgaaa aagagagaga cctttttaaa ctcattttga aaaaagagga 
tgaattgggg gatagatcta taatgttcac tgttcaaaac gaagactag 

atggcctgga ccgttctcct cctcggcctc ctctctcact gcacaggctc tgtgacctcc 6 0 
tactttggca agcttgaatc taaattatca gtcataagaa atttgaatga ccaagttctc 120 
ttcattgacc aaggaaatcg gcctctattt gaagatatga ctgattctga ctgtagagat 180 
aatgcacccc ggaccatatt tattataagt atgtataaag atagccagcc tagaggtatg 24 0 
gctgtaacta tctctgtgaa gtgtgagaaa atttcaactc tctcctgtga gaacaaaatt 3 00 
atttccttta aggaaatgaa tcctcctgat aacatcaagg atacaaaaag tgacatcata 3 60 
ttctttcaga gaagtgtccc aggacatgat aataagatgc aatttgaatc ttcatcatac 42 0 
gaaggatact ttctagcttg tgaaaaagag agagaccttt ttaaactcat tttgaaaaaa 4 80 
gaggatgaat tgggggatag atctataatg ttcactgttc aaaacgaaga ctag 534 

: {SEQ ID NO: 67) 

ATGGCCTGGACCGTTCTCCTCCTCGGCCTCCTCTCTCACTGCACAGGCTCTGTGACCTCC 

atggcctgga ccgttctcct cctcggcctc ctctctcact gcacaggctc tgtgacctcc 60 



: (SEQ ID NO: 68) 

tacttt ggcaagctt gaatctaaat tatcagtcat aagaaatttg aatgaccaag 
ttctcttcat tgaccaagga aatcggcctc tatttgaaga tatgactgat 
tctgactgta gagataatgc accccggacc atatttatta taagtatgta 
taaagatagc cagcctagag gtatggctgt aactatctct gtgaagtgtg 
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agaaaatttc 
atgaatcctc 
tcagagaagt 
catacgaagg 
ctcattttga 
tgttcaaaac 



aactctctcc 
ctgataacat 
gtcccaggac 
atactttcta 
aaaaagagga 
gaagactag 



tgtgagaaca 
caaggataca 
atgataataa 
gcttgtgaaa 
tgaattgggg 



aaattatttc 
aaaagtgaca 
gatgcaattt 
aagagagaga 
gatagatcta 



ctttaaggaa 
tcatattctt 
gaatcttcat 
cctttttaaa 
taatgttcac 



tactttggca agcttgaatc taaattatca gtcataagaa atttgaatga ccaagttctc 60 
ttcattgacc aaggaaatcg gcctctattt gaagatatga ctgattctga ctgtagagat 120 
aatgcacccc ggaccatatt tattataagt atgtataaag atagccagcc tagaggtatg 180 
gctgtaacta tctctgtgaa gtgtgagaaa atttcaactc tctcctgtga gaacaaaatt 240 
atfctcctfcta aggaaatgaa tcctcctgat aacatcaagg atacaaaaag tgacatcata 300 
ttctttcaga gaagtgtccc aggacatgat aataagatgc aatttgaatc ttcatcatac 360 
gaaggatact ttctagcttg tgaaaaagag agagaccttfc ttaaactcat tttgaaaaaa 420 
gaggatgaat tgggggatag atctataatg ttcactgttc aaaacgaaga ctag 474 

; (SEQ ID NO: 69) 

mWTVLLLGLLSHCTGSVTSYFGKLESKLSVlRNLND^ 

M YKDS QPRGMAVTISVKCEKI S TLS CENKI ISFKEMNPPDNI KDTKSDI I FFQRS VPGHDNKMQFES SS Y 
EGYFLACEKERDLFKDILKKEDELGDRSIMFTVQNED 



Met Ala Trp Thr 

1 

Ser Val Thr Ser 

20 

Arg Asn Leu Asn 
35 

Leu Phe Glu Asp 
50 

Thr lie Phe lie 
65 

Ala Val Thr lie 



Glu Asn Lys lie 

100 



Val Leu Leu Leu 
5 

Tyr Phe Gly Lys 



Asp Gin Val Leu 

40 

Met Thr Asp Ser 
55 

lie Ser Met Tyr 
70 

Ser Val Lys Cys 
85 

lie Ser Phe Lys 



Gly Leu Leu Ser 

10 

Leu Glu Ser Lys 
25 

Phe lie Asp Gin 



Asp Cys Arg Asp 

60 

Lys Asp Ser Gin 
75 

Glu Lys He Ser 
90 

Glu Met Asn Pro 
105 



His Cys Thr Gly 
15 

Leu Ser Val He 
30 

Gly Asn Arg Pro 
45 

Asn Ala Pro Arg 



Pro Arg Gly Met 

SO 

Thr Leu Ser Cys 
95 

Pro Asp Asn He 
110 



Lys Asp Thr Lys 
115 

His Asp Asn Lys 
13 0 

Leu Ala Cys Glu 
145 

Glu Asp Glu Leu 



Ser Asp He He 

120 

Met Gin Phe Glu 
135 

Lys Glu Arg Asp 

150 

Gly Asp Arg Ser 
165 



Phe Phe Gin Arg 



Ser Ser Ser Tyr 

14 0 

Leu Phe Lys Leu 
155 

He Met Phe Thr 
170 



Ser Val Pro Gly 
12 5 

Glu Gly Tyr Phe 



He Leu Lys Lys 

160 

Val Gin Asn Glu 
175 



Asp 



: (SEQ ID NOs70) 
MAWTVLLLGLLSHCTGSVTS 
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Met Ala Trp Thr Val Leu Leu Leu Gly Leu Leu Ser His Cys Thr Gly 
15 10 15 

Ser Val Thr Ser 

20 



: (SEQ ID NOi7l) 

YFGKLESKLSVIRH^NDQVLFIDQGNRPLFED^^ 

TSTLSCEl^IISFKEMNPPDNlKDTKSDIlFFQRSVPGmDNKMQFESSSYEGYFLACEKERDLFKLXLK^ 
EDELGDRS IMFTVQNED 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 
15 10 15 

Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe lie 
35 40 45 

lie Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr lie 
50 55 60 

Ser Val Lys Cys Glu Lys lie Ser Thr Leu Ser Cys Glu Asn Lys lie 

65 70 75 

lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 

85 90 95 

Ser Asp He He Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 
115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu He Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu Asp 

145 150 

Several changes could be made in EL- 18, e.g., as presented herein. Changes in non-surface 
exposed residues that could be made that would result in the high probability of retention of IL- 
18 activity with no changes in iinmunogenicity are: 

Thr 10 for Ser 10 
Val 12 for He 12 
Ser 45 for Thr 45 
Tyr 47 for Phe 47 
Phe 52 for Tyr 52 
Val 64 for lie 64 
Tyr 101 for Phe 101 
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These compounds would be useful as IL-18 agonists, for raising anti-IL-18 antibodies, for 
assays for IL-18 or IL-18 binding proteins and for preparation of affinity columns for the 
purification of IL-18 binding proteins. 

Changes in amino acids with a low percentage of surface exposure that could be made that 
would result in the high probability of retention of JL-1 8 activity with possible changes in 
immunogenicity are: 

Val s forLeu 5 
Val 20 for Leu 20 
He 20 for Leu 20 
Tyr 21 for Phe 21 
Val 22 for lie 22 
lie 66 for Val 65 
Thr 72 for Ser 72 
Phe 148 for Ser 148 

These compounds would be useful as IL-1 8 agonists, for raising anti-EL-1 8 antibodies, for 
assays for 1-18 or IL-18 binding proteins and for preparation of affinity columns for the 
purification of IL-18 binding proteins. 

Changes that could be made in amino acids involved in receptor contact that would result in 
alteration of IL-1 8 activity by either increasing or decreasing binding of the IL-18 analog to the 
IL-1 8 receptor are: 

Glu 4 for Lys 4 
He 6 for Glu 6 
Asp 8 for Lys s 
lie 13 for Arg 13 
Arg 15 for Leu 15 
Lys 17 for Asp 17 
Lys 27 for Arg 27 
Ala 30 for Phe 30 
Lys 35 for Asp 3S 
Phe 37 for Asp 37 
Glu 38 for Cys 38 
Ala 39 for Arg 39 
Trp 40 for Asp 40 
Glu S! forMet 51 
Gly 53 for Lys 53 
He 56 for Gin 56 
Ala 58 for Arg 58 
Lys 62 for Val 62 
Lys 94 for Asp 94 
Phe 95 for Thr 95 
Leu 104 for Arg 104 
He 108 for Gly 108 
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Lys 111 for Asn nl 
Phe 129 for Lys 129 
Asp ,31 forArg 131 
Leu 132 for Asp 132 
Glu ,33 forLeu 133 
Ala 134 for Phe 134 
Thr ,50 forMet 150 
Ser 151 for Phe 151 

Depending on the alteration of receptor binding or receptor activity, these compounds would be 
useful as IL-18 agonists or antagonists, for preparation of antibodies against IL-18, in assays 
for IL-18 or IL-1 8 binding proteins and the preparation of affinity columns for the purification 
of IL-1 8 binding proteins. 



3. Other claimed changes in mature human IL-18 protein sequence: 

a. Human sequence reference AF380360-1, linked to either signal 
sequence listed above, with the following sequence of mature 
human IL-18; this appears to be a natural variant of human IL- 
18, with changes in blue, 
i {SEQ ID NO: 72) 

YFGKLESK LSVIRKI^NQVLFIDQGNRPLFBDMTDSDCRDNAPRTIFIISMYKDSQPRGMAVTISV 
KCEKI S TLSCENKI I S FKEVNPPDNI KDTKSD 1 1 FFQRS VPGHDNKMQFESS SYEGYF 
LTCEKERDLFKLI LKKEDELGDRS IMFTVQNED 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 
15 10 15 

Asn Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe He 
35 40 45 

lie Ser Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Thr He 
50 55 60 

Ser Val Lys Cys Glu Lys He Ser Thr Leu Ser Cys Glu Asn Lys He 
65 70 75 80 

lie Ser Phe Lys Glu Val Asn Pro Pro Asp Asn He Lys Asp Thr Lys 

85 90 95 

Ser Asp lie He Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Thr Cys Glu 
115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu He Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 
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(SEQ ID NO: 73) 

tactttggca agcttgaatc taaattatca gtcataagaa atttgaataa ccaagttctc 60 
ttcattgacc aaggaaatcg gcctctattt gaagatatga ctgattctga ctgtagagat 120 
aatgcacccc ggaceatatt tattataagt atgtataaag atagccagcc tagaggtatg 180 
gctgtaacta tctctgtgaa gtgtgagaaa atttcaactc tctcctgtga gaacaaaatt 240 
atttccttta aggaagtgaa tcctcctgat aacatcaagg atacaaaaag tgacatcata 300 
ttctttcaga gaagtgtccc aggacatgat aataagatgc aatttgaatc ttcatcatac 360 
gaaggatact ttctaacttg tgaaaaagag agagaccttt ttaaactcat tttgaaaaaa 42 0 
gaggatgaat tgggggatag atctataatg ttcactgttc aaaacgaaga ctag 474 



b. Human sequence reference AAC27787 ; this appears to be a natural 
variant of human IL-18. Only mature human XL- 18 protein is 
shown, DNA sequence is not available from database; 
y f gkl e sk 1 s v i rnl ndqv 1 f idqgnrp 1 ledmt dsdcrdnapr t i f i i rmykds qprgmavt i svkcek 
i s 1 1 s cenki i s f kemnppdnikdt kad i i f f qrsvpghdnkmqf es s syegyf lacekerdl f kli lkk 
edelgdrs imf tvqsed 

(SEQ ID NOs74) 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser Val lie Arg Asn Leu Asn 
15 10 15 

Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Leu Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe lie 
35 40 45 

He Arg Met Tyr Lys Asp Ser Gin pro Arg Gly Met Ala Val Thr lie 
50 55 60 

Ser Val Lys Cys Glu Lys He Ser Thr Leu Ser Cys Glu Asn Lys He 
65 70 75 80 

He Ser Phe Lys Glu Met Asn Pro Pro Asp Asn He Lys Asp Thr Lys 

85 90 95 

Ser Asp He He Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 
115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu lie Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Ser Glu Asp 
145 150 155 



c. Macaque sequence reference AF303732; mature macaque protein and 
DNA sequences are shown, and would be linked to either signal 
sequence shown above. Blue residues are altered from human 
consensus sequence: 
(SEQ ID NOs 75) 
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YFGKI+ESKLSXIRNLNDQVLFIDQGlSn^PLFED^ 

KCEKISTLS CENRI I S FKEMNPPDNIKDTKSD 1 1 FFQRS VPGHDKKMQFES S S YEG YFL ACE KERDL YKL 
ILKKKDELGDRS IMFTVQNED 

Tyr Phe Gly Lys Leu Glu Ser Lys Leu Ser lie lie Arg Asn Leu Asn 
15 10 15 

Asp Gin Val Leu Phe lie Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr lie Phe lie 
35 40 45 

lie Asn Met Tyr Lys Asp Ser Gin Pro Arg Gly Met Ala Val Ala lie 
50 55 60 

Ser Val Lys Cys Glu Lys lie Ser Thr Leu Ser Cys Glu Asn Arg lie 
65 70 75 80 

lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn lie Lys Asp Thr Lys 

85 90 95 

Ser Asp He He Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 
115 120 125 

Lys Glu Arg Asp Leu Tyr Lys Leu He Leu Lys Lys Lys Asp Glu Leu 
130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

(SSQ ID NOr76) 

tactttggca agcttgaatc taaattatca atcataagaa atttgaatga ccaagttctc 6 0 

ttcattgacc aaggaaatcg gcccctattt gaagatatga ctgattctga ctgtagagat 120 

aatgcacccc ggaccatatt tattataaat atgtataaag atagccagcc tagaggtatg 180 

gctgtagcca tctctgtgaa atgtgagaaa atttcaactc tctcctgtga gaacagaatt 240 

atttccttta aggaaatgaa tcctcctgat aacatcaagg atacgaaaag tgacatcata 3 00 

ttctttcaga gaagtgtccc aggacatgat aataagatgc aatttgaatc ttcatcatac 360 

gaaggatact ttctagcttg tgaaaaagag agagaccttt ataaactcat tttgaaaaaa 42 0 

aaggatgaat tgggggatag atctataatg ttcactgttc aaaacgaaga ctag 474 



Reference: J" Interferon Cytokine Research 21*173-180, 2001, LD 

Giavedonl et al 

d. Mutant human IL-18 with increased II*- 18 activity and reduced 

ability to be inhibited by IL-18 binding protein; mature human IL- 
18 sequence with two altered residues indicated in blue; 
(SEQ ID NO:77) 

YFGKLAS KLSVI RNLNDQVL F IDQGNRPLFEDMTDSDCRDNAPRTI FI I SM YADSQPRGMAVTISVKCEK 
ISTDSCENKIISFKEMNPPDNIKDTKSDIIFFQRSVPGHDNKMQFESSSYEGYFI±ACEKERDLFKIjILKK 
EDELGDRSIMFTVQNED 
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Tyr Plie Gly Lys Leu Ala Ser Lys Leu Ser Val He Arg Asn Leu Asn 
15 10 15 

Asp Gin Val Leu Phe He Asp Gin Gly Asn Arg Pro Leu Phe Glu Asp 

20 25 30 

Met Thr Asp Ser Asp Cys Arg Asp Asn Ala Pro Arg Thr He Phe He 
35 40 45 

He Ser Met Tyr Ala Asp Ser Gin Pro Arg Gly Met Ala Val Thr He 
50 55 60 

Ser Val Lys Cys Glu Lys He Ser Thr Leu Ser Cys Glu Asn Lys He 
65 70 75 80 

lie Ser Phe Lys Glu Met Asn Pro Pro Asp Asn He Lys Asp Thr Lys 

85 90 95 

Ser Asp He lie Phe Phe Gin Arg Ser Val Pro Gly His Asp Asn Lys 

100 105 110 

Met Gin Phe Glu Ser Ser Ser Tyr Glu Gly Tyr Phe Leu Ala Cys Glu 
115 120 125 

Lys Glu Arg Asp Leu Phe Lys Leu He Leu Lys Lys Glu Asp Glu Leu 
130 135 140 

Gly Asp Arg Ser He Met Phe Thr Val Gin Asn Glu Asp 
145 150 155 

Reference: PNAS 98:3304-3309, 2001 SM Kim et al. 

Accordingly, based on the above non-limiting examples of specific substitutions, 
alternative substitutions can be made by routine experimentation, to provide alternative 
tumor/adjuvant vaccines of the present invention, e.g., by making one or more 
substitutions, insertions or deletions in proteins or tumor proteins which give rise to 
effective immune responses. 

Amino acid sequence variations in a tumor protein or cytokine of the present invention 
can be prepared e.g., by mutations in the DNA. Such tumor or cytokine variants 
include, for example, deletions, insertions or substitutions of nucleotides coding for 
different amino acid residues within the amino acid sequence. Obviously, mutations 
that will be made in nucleic acid encoding a tumor protein or cytokine must not place 
the sequence out of reading frame and preferably will not create complementary 
domains that could produce secondary mRNA structures (see, e.g., Ausubel (1995 
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rev.), infra; Sambrook (1989), infra). 

Tumor protein or cytokine-encoding nucleic acid of the present invention can also be 
prepared by amplification or site-directed mutagenesis of nucleotides in DNA or RNA 
encoding a tumor or cytokine protein or portion thereof, and thereafter synthesizing or 
reverse transcribing the encoding DNA to produce DNA or RNA encoding a tumor 
protein or cytokine variant (see, e.g., Ausubel (1995 rev.), infra; Sambrook (1989), 
infra), based on the teaching and guidance presented herein. 

Recombinant viruses expressing tumor/adjuvant proteins of the present invention, or 
nucleic acid vectors encoding therefor, include a finite set of tumor/adjuvant-encoding 
sequences as substitution nucleotides that can be routinely obtained by one of ordinary 
skill in the art, without undue experimentation, based on the teachings and guidance 
presented herein. For a detailed description of protein chemistry and structure, see 
Schulz, G. E. et aL, Principles of Protein Structure, Springer- Verlag, New York, N.Y. 
(1978), and Creighton, T. E., Proteins: Structure and Molecular Properties, W. IL 
Freeman & Co., San Francisco, Calif (1983), which are hereby incorporated by 
reference. For a presentation of nucleotide sequence substitutions, such as codon 
preferences, see Ausubel et al., eds, Current Protocols in Molecular Biology, Greene 
Publishing Assoc., New York, N.Y, (1987-2001) (hereinafter, "Ausubel et al, sections 
A, 1.1 -A. 1.24, and Sambrook, J. et aL, Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989) at Appendices C and D. 

Thus, one of ordinary skill in the art, given the teachings and guidance presented 
herein, will know how to substitute other amino acid residues in other positions of an 
DNA or RNA to obtain alternative tumor/adjuvant vaccines, including substitutional, 
deletional or insertional variants. 

EXAMPLES 

Screening Assays for Tumor Activity 
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For screening anti-tumor activity of sera or cells from an individual immunized with a 
vaccine of the invention, any known and/or suitable screening assay can be used, as is 
known in the art. 

Specific Embodiment: Recombinant Vaccinia Virus Encoding tumor/adjuvant's, 
Nucleic acid vaccines and Methods of Making and Using Thereof 

Overview. A suitable recombinant viral vector is used according to the present 
invention for expressing tumor proteins (e.g., MUC-1, PSA, KLK3 or any portion, 
variant or combination thereof) to provide at least a portion of a vaccine useful for the 
production, testing or use of a tumor vaccine of the present invention that induces at 
least one of a humoral or cellular immune response against the tumor, a portion thereof 
or a cell thereof, as well as for analyses of B -cell and CTL determinants, 

A tumor vaccine of the present invention expresses at least one tumor nucleic acid or 
protein (tumor/adjuvant) and at least one adjuvant nucleic acid or protein. The tumor 
vaccine functionally encodes at least one tumor/adjuvant or adjuvant. Multiple, distinct 
fragments or plasmids encoding tumor/adjuvant and/or adjuvant (e.g., IL-18) can be 
prepared by substituting one tumor/adjuvant encoding sequence with another, e.g., 
using a restriction fragment or mutagenesis, according to known methods (see, e.g., 
Ausubel or Sambrook, supra). 

Preparation of Tumor Vaccine, Methods for the preparation of individual plasmids 
(each expressing at least one unique tumor or adjuvant protein sequence) can utilize 
DNA or RNA amplification for the substitution of isolated protein variant sequences 
into a vector , which vector encodes a known tumor and/or adjuvant protein sequence, 
as known in the art. 

Methods of amplification of RNA or DNA are well known in the art and can be used 
according to the present invention without undue experimentation, based on the 
teaching and guidance presented herein. Known methods of DNA or RNA 
amplification include, but are not limited to polymerase chain reaction (PCR) and 
related amplification processes (see, e.g., U.S. Pat. Nos. 4,683,195,4,683,202, 
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4,800,159, 4,965,188, to Mullis et aL; U.S. Pat Nos. 4,795,699 and 4,921,794 to 
Tabor et al; U.S. Pat. No. 5,142,033 to Innis; U.S. Pat No. 5,122,464 to Wilson et 
al.;U.S. Pat. No. 5,091,310 to Innis; U.S. Pat. No, 5,066,584 to Gyllensten et al; 
U.S. Pat No. 4,889,818 to Gelfand et al; U.S. Pat No. 4,994,370 to Silver et al; 
U.S. Pat No. 4,766,067 to Biswas; U.S. Pat No. 4,656,134 to Ringold) and RNA 
mediated amplification which uses anti-sense RNA to the target sequence as a template 
for double stranded DNA synthesis (U.S. Pat. No. 5, 1 30,238 to Malek et al, with the 
trade name NASBA), the entire contents of which patents are herein entirely 
incorporated by reference. 

For example, recombinant tumor vaccine constructs prepared by this route can be used 
for immunizations and elicitation of tumor-specific T and/or B-cell responses. Primers 
utilize conserved tumor sequences and thus successfully amplify genes from many 
diverse tumor patient or cell samples or from tumor nucleic acid libraries, as non- 
limiting examples. The basic techniques described here can similarly be used with 
PGR or other types of amplification primers, in order to substitute smaller or larger 
pieces of the sequence from field isolates for that found in vectors encoding a tumor 
protein. See, e.g., Ausubel; supra, Sambrook, supra. 

Tumor/ Adjuvant Encoding Nucleic Acids. The technique can use, as a non-limiting 
example, the isolation of DNA from tumor infected cells and the amplification of 
sequences by PCR> PCR or other amplification products provide the simplest means 
for the isolation of tumor sequences, but any other suitable and known methods can be 
used such as cloning and isolation of tumor/adjuvant encoding nucleic acid or proteins 
(see Ausubel, infra; Sambrook, infra). Enzyme restriction sites are preferably 
incorporated into PCR or other amplification primer sequences to facilitate gene 
cloning* 

Isolated DNA for PCR can be prepared from multiple tumor or adjuvant sources, 
inclusive of fresh or frozen whole blood or tumor tissue or cells from tumor+ patients 
and cells that have been infected in vitro with tumor virus isolates. 
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In order to produce new tumor/adjuvant constructs, the polymerase chain reaction 
(PGR) is preferably used to amplify 100-2700 base pairs (bp) of a tumor protein 
encoding nucleic acid from each different tumor patient, tissue or cell sample. The 
PCR primers can represent well-conserved tumor sequences which are suitable for 
amplifying genes from known samples of genes, isolated tumors or diverse tumor 
patient samples. The amplified DNA preferably comprises a portion encoding 10-900 
(such as 100-400, 400-600 or 600-900, or any range or value therein) amino acids of a 
PSA, MUC-1 or KLK-3 protein. Preferably, most or all of the entire gene is amplified. 
Optionally, the MUC-1 encoding sequence amplified is missing part or all of sequences 
encoding the 20 amino acid repeat or any combination or number of copies thereof, 
such but not limited, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
21, 22, 23, 24, 25, 26, 27, 28, 29 or 30 copies or any fraction thereof, such .1, .2, .3, A, 
.5, .6, .7, .8, S of the encoding nucleic acid repeat, or 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 
13, 14, 15, 16, 17, 18, 19, 20 amino acids or any combination thereof Non-limiting 
examples include 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5, 10, 
10,5, 11, 11.5, 12, 12.5, 13, 13.5, 14, 14.5, 15, 15.5, 16, 16.5, 17, 17.5, 18, 18,5, and the 
like, including any fractional amount thereof, such as .1, .2, and the like. 

The PCR primers can be designed so that restriction enzyme sites flank the tumor 
protein or cytokine adjuvant gene sequence in a suitable expression plasmid or vector, 
such that they are incorporated into the amplified DNA products. Suitable host cells 
can then be transformed with the tumor/adjuvant plasmid(s) via any of a number of 
methods well-known in the art, including, e.g., electroporation, and recombinant 
colonies are picked and examined by sequencing. 

Methods for the production of expression vectors are well-known in the art (see, e.g., 
Mackett,M etal„Proc. Natl. Acad. ScL (USA) 79:7415-7419 (1982); Panicali, D., 
and Paoletti, E. a Proc. Natl. Acad. Sci, (USA) 79:4927-4931 (1982); U.S, Pat No. 
4,169,763; Mazzara, G. P. et aL, Methods in Enz, 217:557-581 (1993), Ausubel et al., 
infra, (e.g., 16.15-16.19), each of which are entirely incorporated herein by reference). 
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For use in the present invention a nucleic acid vaccine or a viral vector vaccine can be 
either used alone, in combination or sequentially. 

As a non-limiting example of a suitable viral vector for a tumor vaccine of the present 
invention, vaccinia virus has a number of useful characteristics, including capacity that 
permits cloning large fragments of foreign DNA (greater than 20 Kb), retention of 
infectivity after insertion of foreign DNA, a wide host range, a relatively high level of 
protein synthesis, and suitable transport, secretion, processing and post-translational 
modifications as dictated by the primary structure of the expressed protein and the host 
cell type use. For example, N-Oglycosylation, phosphorylation, myristylation, and 
cleavage, as well as assembly of expressed proteins, occur in a faithful manner. 

Several variations of the vaccinia vector have been developed and are suitable for use 
in the present invention (e.g., see Ausubel et al., infra, sec. 16.15-16.19). Most 
commonly, after obtaining the virus stock (Ausubel, infra at sec, 16.16), a nucleic acid 
sequence encoding a tumor/adjuvant is placed under control of a vaccinia virus 
promoter and integrated into the genome of vaccinia so as to retain infectivity (Ausubel 
et al., infra at sec. 16.17). Alternatively, expression can be achieved by transfecting a 
plasmid containing the vaccinia promoter-controlled gene encoding a tumor/adjuvant 
into a cell that has been infected with wild-type vaccinia. 

Preferably, the host cell and vector are suitable and approved for use in vaccination of 
mammals and humans. These recombinant vectors are then characterized using various 

r 

known methods (Ausubel et al., infra at sec. 16.18). In still another variation, the 
bacteria phage T7 RNA polymerase chain can be integrated into the genome of the 
vector so that the tumor/adjuvant encoding sequences will be expressed under the 
control of a T7 promoter, either in transfected plasma, plasmid or a recombinant 
vaccinia virus, will be expressed. 

The use of pox virus promoters is preferred for vaccinia expression because cellular 

and other viral promoters are not usually recognized by the vaccinia transcriptional 

apparatus. A compound early/late promoter is preferably used in recombinant vaccinia 

for nucleic acid vaccines, as it is desirable to express the tumor/adjuvant as an antigen 
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that is presented in recombinant vaccinia virus infected host cell in association with 
major histocompatibility class (MHC) I or EL Such MHC associated tumor protein will 
then form cytotoxic T cell targets, and prime vaccinated mammals for a cytotoxic T cell 
response and/or a humoral response against the expressed tumor tumor/adjuvants. This 
is because the ability of vaccinia viral vectors to induce MHC presentation in host cells 
for this type of antigen appears to diminish late in the infection stage. Transcripts 
originating early will terminate after the sequence TTTTTNT and lead to inadequate 
MHC presentation. 

Alternatively, any such termination motifs within the coding sequence of the gene can 
be altered by mutagenesis if an early pox virus promoter is used, in order to enhance 
MHC presentation of protein antigens in host cells {Earl et al., infra* 1990). To mimic 
vaccinia virus mRNAs, untranslated leader and 3 -terminal sequences are usually kept 
short, if they are used in the vaccinia plasmids incorporating tumor/adjuvant encoding 
sequences. 

Preferably, the plasmid used for making vaccinia constructs according to the present 
invention has been designed with restriction endonuclease sites for insertion of the gene 
downstream of the vaccinia promoter (Ausubel et aK, infra, sec. 16,17). More 
preferably, the plasmid already contains an protein encoding sequence, wherein the 
restriction sites occur uniquely near each of the beginning and ends of the protein 
coding sequence. The same restriction fragment of the tumor/adjuvant encoding 
sequence can then replace the corresponding sequence in the plasmid. In such cases, 
the major portion of the tumor/adjuvant encoding sequence can be inserted after 
removing most or all of the protein encoding sequence from the plasmid. 

Preferably, the resulting vaccinia construct (containing the tumor/adjuvant encoding 
sequence and the vaccinia promoter) is flanked by vaccinia DNA to permit homologous 
recombination when the plasmid is transfected into cells that have been previously 
infected with wild-type vaccinia virus. The flanking vaccinia virus DNA is chosen so 
that the recombination will not interrupt an essential viral gene. 

Without selection, the ratio of recombinant to parental vaccinia virus is usually about 

65 



WO 03/031569 



FCT/US02/29640 



1 : 1 000. Although this frequency is high enough to permit the use of plaque 
hybridization (see Ausubel et al., infra at sec. 63 and 6,4) or immunoscreening 
(Ausubel et al., infra at sec, 6.7) to pick recombinant viruses, a variety of methods to 
facilitate recombinant-virus identification have been employed. Nonlimiting examples 
of such selection or screening techniques are known in the art (see Ausubel et al., infra 
at sec. 16. 1 7). Usually, the expression cassette is flanked by segments of the vaccinia 
thymidine kinase (TK) genes so that recombination results in inactivation of TK. Virus 
with a TK.sup.- phenotype can then be distinguished from those with a TK.sup.+ 
phenotype by infecting a TK.sup,- cell line in the presence of 5-bromo-deoxyuridine (5- 
BrdU), which must be phosphorylated by TK to be Iethally incorporated into the virus 
genome. Alternatively or additionally, recombinant viruses can be selected by the co- 
expression of a bacterial antibiotic resistant gene such as ampicillin (amp) or guanine 
phosphoribosyl transferase (gpt). As a further example, co-expression of the 
Escherichia coli lac Z gene allows co-screening of recombinant virus plaques with Xgal 
(Ausubel, infra, sec. 16.17). 

The recombinant vaccinia viruses expressing a tumor/adjuvant of the present invention 
can be optionally attenuated or inactivated according to known methods, such as by 
heat, paraformaldehyde treatment, ultraviolet irradiation, propriolactene treatment, 
hybrid or chimera formation or by other known methods (see, e.g., Zagury et al., Nature 
332:728-731 (1988); Ito et al., Cancer Res, 50:6915-6918 (1990); Wellis et al. s J. 
Immunol. 99:1134-9 (1967); D'Honcht, Vaccine 10 (Suppl.):548-52 (1992); Selenka et 
al., Arch. Hyg. BakterioL 153:244-253 (1969); Grundwald-Bearch et al., J. Cancer 
Res. Clin. Oncol. 1 17:561-567 (1991); the contents of which are entirely incorporated 
here by reference). For example, heat inactivation at 60.degree. C. will reduce virus 
titer considerably. Such attenuation techniques are safety tested, as incomplete 
inactivation might result in patient death (Dorozynski and Anderson, Science 252:501- 
502 (1991)). 

Such attenuated or inactivated recombinant vaccinia is to be used where the patient 
may have a compromised immune system as complications or death can occur when 
live vaccinia is administered. 

66 



WO 03/031569 



PCT/US02/29640 



Pharmaceutical Compositions 

Pharmaceutical preparations of the present invention, suitable for inoculation or for 
parenteral or oral administration, include a polyrecombinant virus vaccine comprising 
of at least 4, and up to about 10,000, preferably 4 to about 1000, and more preferably 
about 10 to about 100 different recombinant viruses, in the form of a cell lysate, 
membrane-bound fraction, partially purified, or purified form. Preferably, the nucleic 
acid vaccine comprises recombinant virus containing cell lysate (or membrane-bound 
fractions thereof) that further comprise tumor/adjuvant proteins already expressed by 
the recombinant viruses* The inclusion of the expressed tumor/adjuvants is now 
discovered to enhance the primary antibody response. 

The nucleic acid vaccine composition can be in the form of sterile aqueous or non- 
aqueous solutions, suspensions, or emulsions, and can also contain auxiliary agents or 
excipients which are known in the art. Each of the at least about 4-20 different viruses 
encode and express a different tumor/adjuvant, as presented herein, tumor/adjuvants 
encoding DNA can be selected to represent tumor/adjuvants suitable for treatment. For 
example, a vaccine could represent sequences from any or any combination of suitable 
tumors and adjuvant proteins. 

A nucleic acid vaccine composition can further comprise immunomodulators such as 
cytokines which accentuate an immune response to a viral infection. See, e.g., Berkow 
et aL, eds,, The Merck Manual, Fifteenth Edition, Merck and Co., Rahway, N J, 
(1987); Goodman et al., eds., Goodman and Gilman's The Pharmacological Basis of 
Therapeutics, Eighth Edition, Pergamon Press, Inc., Elmsford, N.Y* (1990); Avery's 
Drug Treatment: Principles and Practice of Clinical Pharmacology and Therapeutics, 
Third Edition, ADIS Press, LTD., Williams and Wilkins, Baltimore, Md (1987); and 
Katzung, ed. Basic and Clinical Pharmacology, Fifth Edition, Appleton and Lange, 
Norwalk, Conn. (1992), which references and references cited therein, are entirely 
incorporated herein by reference as they show the state of the art. 

As would be understood by one of ordinary skill in the art, when a nucleic acid vaccine 
of the present invention is provided to an individual, it can be in a composition which 
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can further comprise at least one of salts, buffers, adjuvants, or other substances which 
are desirable for improving the efficacy of the composition. Adjuvants are substances 
that can be used to specifically augment at least one immune response. Normally, the 
adjuvant and the composition are mixed prior to presentation to the immune system, or 
presented separately, but into the same site of the being immunized. Adjuvants can be 
loosely divided into several groups based upon their composition. These groups 
include oil adjuvants, mineral salts (for example, AlK(SO,sub.4)*sub.2, 
AlNa(SO.sub.4).sub.2, AlNH.sub.4 (SO.sub.4), silica, kaolin, and carbon), 
polynucleotides (for example, poly IC and poly AU nucleic acids), and certain natural 
substances (for example, wax D from Mycobacterium tuberculosis, substances found in 
Corynebacterium parvum, or Bordetella pertussis, and members of the genus Brucella). 
Among those substances particularly useful as adjuvants are the saponins (e.g., Qui! A., 
Superfos A/3, Denmark), Examples of materials suitable for use in vaccine 
compositions are disclosed, e.g., in Osol, A,, ed., Remington's Pharmaceutical 
Sciences, Mack Publishing Co., Easton, Pa. (1980), pp. 1324-1341, which reference is 
entirely incorporated herein by reference. 

A pharmaceutical vaccine composition of the present invention can further or 
additionally comprise at least one antiviral chemotherapeutic compound. Non-limiting 
examples can be selected from at least one of the group consisting of gamma globulin, 
amantadine, guanidine, hydroxy benzimidazole, interferon-. alpha., interferon-.beta., 
interferon-.gamma., interleukin-16 (IL-16; Kurth, Nature, Dec. 8, 1995); 
thiosemicarbarzones, methisazone, rifampin, ribvirin, apyrimidine analog (e.g., AZT 
and/or 3TC), a purine analog, foscamet, phosphonoacetic acid, acyclovir, 
dideoxynucleosides, a protease inhibitor (e.g., saquinavir (Hoffinann-La Roche); 
indinavir (Merck); ritonavir (Abbott Labs); AG 1343 (Agouron Pharmaceuticals); VX- 
2/78 (Glaxo Wellcome)); chemokines, such as RANTE3, MIP1. alpha, or MlPl.beta. 
(Science 270:1560-1561 (1995)) or ganciclovir. See, e.g., Richman: AIDs Res. Hum. 
Retroviruses 8: 1065-1071 (1992); Annu Rev Pharmacol Toxico 33: 149-164 (1993); 
Antimicrob Agents Chemother 37: 1207-1213 (1993); AJDs Res. Hum. Retroviruses 
10: 901 (1994): Katzung (1992), infra, and the references cited therein on pages 798- 
800 and 680-681, respectively, which references are herein entirely incorporated by 
reference. 
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Pharmaceutical Uses 

The administration of a vaccine (or the antisera which it elicits) can be for either a 
"prophylactic" or "therapeutic" purpose, and preferably for prophylactic purposes. 
When provided prophylactically, the nucleic acid vaccine composition is provided in 
advance of any detection or symptom of tumor associated pathology. The prophylactic 
administration of the compound(s) serves to prevent or attenuate any subsequent tumor 
associated pathology. 

When provided therapeutically, the nucleic acid or viral vaccine is provided upon the 
detection of a symptom of actual infection. The administration of a vaccine after 
detection of tumor-associated pathology is provided only where the patient's immune 
system is determined to be capable of responding to administration of a vaccine of the 
present invention. 

Alternatively, where the patient's immune response is compromised, therapeutic 
administration preferentially involves the use of an attenuated or inactivated viral 
vaccine composition where the viral vaccines are attenuated or inactivated, as presented 
above. See, e.g., Berkow (1987), infra, Goodman (1990), infra, Avery (1987), infra 
and Katzung (1992), infra, Dorozynski and Anderson, Science 252:501-502 (1991) 
which are entirely incorporated herein by reference, including all references cited 
therein. 

A composition is said to be "pharmacologically acceptable" if its administration can be 
tolerated by a recipient patient. Such an agent is said to be administered in a 
"therapeutically or prophylactically effective amount" if the amount administered is 
physiologically significant. A vaccine or composition of the present invention is 
physiologically significant if its presence results in a detectable change in the 
physiology of a recipient patient, preferably by enhancing a humoral or cellular 
immune response to a tumor. 

The "protection" provided need not be absolute, i.e., the tumor need not be totally 
prevented or eradicated, provided that there is a statistically significant, improvement 
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relative to a control population. Protection can be limited to mitigating the severity or 
rapidity of onset of symptoms of the disease. 

Pharmaceutical Administration 

A vaccine of the present invention can confer resistance to one or more types of a 
tumor. The present invention thus concerns and provides a means for preventing or 
attenuating infection by at least one tumor. As used herein, a vaccine is said to prevent 
or attenuate a disease if its administration to an individual results either in the total or 
partial attenuation (i.e. suppression) of a symptom or condition of the disease, or in the 
total or partial immunity of the individual to the disease. 

At least one nucleic acid vaccine of the present invention can be administered by any 
means that achieve the intended purpose, using a pharmaceutical composition as 
described herein. 

For example, administration of such a composition can be by various parenteral routes 
such as subcutaneous, intravenous, intradermal, intramuscular, intraperitoneal, 
intranasal, transdermal, or buccal routes. Subcutaneous administration is preferred. 
Parenteral administration can be by bolus injection or by gradual perfusion over time. 
See, e.g., Berkow (1987), infra, Goodman (1990), infra, Avery (1987), infra, and 
Katzung (1992), infra, which are entirely incorporated herein by reference, including all 
references cited therein. 

A typical regimen for preventing, suppressing, or treating a disease or condition which 
can be alleviated by a cellular immune response by active specific cellular 
immunotherapy, comprises administration of an effective amount of a vaccine 
composition as described is above, administered as a single treatment, or repeated as 
enhancing or booster dosages, over a period up to and including one week to about 24 
months. 

According to the present invention, an "effective amount" of a vaccine composition is 
one which is sufficient to achieve a desired biological effect, in this case at least one of 
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cellular or humoral immune response to at least one tumor. It is understood that the 
effective dosage will be dependent upon the age, sex, health, and weight of the 
recipient, kind of concurrent treatment, if any, frequency of treatment, and the nature of 
the effect desired. The ranges of effective doses provided below are not intended to 
limit the invention and represent preferred dose ranges. However, the most preferred 
dosage will be tailored to the individual subject, as is understood and determinable by 
one of skill in the art, without undue experimentation. See, e.g., Berkow (1987), infra, 
Goodman (1990), infra, Avery (1987), infra, Ebadi, Pharmacology, Little, Brown and 
Co., Boston, Mass. (1985), and Katsung (1992), infra, which references and references 
cited therein, are entirely incorporated herein by reference. Whatever dosage is used, it 
should be a safe and effective amount as determined by known methods, as also 
described herein. 

Subjects 

The recipients of the vaccines of the present invention can be any mammal which can 
acquire specific immunity via a cellular or humoral immune response to tumor, where 
the cellular response is mediated by an MHC class I or class II protein. Among 
mammals, the preferred recipients are mammals of the Orders Primata (including 
humans, chimpanzees, apes and monkeys). The most preferred recipients are humans. 

Having now generally described the invention, the same will be more readily 
understood through reference to the following examples, which are provided by way of 
illustration, and are not intended to be limiting of the present invention. 

Examples 

We believe it is preferable that cytotoxic immunity to MUC1 be generated 
through the expression of MUC1 by antigen presenting cells with the subsequent 
presentation of digested MUC1 peptides in the context of Class I molecules. Transgene 
has taken an approach along these lines, using a vaccinia virus encoding MUC1 and IL-2 
(29-3 1). This strategy would allow expression of MUC1 with natural processing of 
peptide for presentation to the immune system, with the function of IL-2 being to support 
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the growth of CTLs, In three of nine patients, cellular responses were detected, and the 
two patients with documented CTL activity survived the longest, although the results are 
not significant (3 1), One important limitation to this strategy is that repeated 
administration of a viral vector results in a strong immune response to the vector itself. 
This limits the number of times the drug can be administered, because the host immune 
response acts to clear the drug very quickly. Another approach that may make its way to 
the clinic, and appears effective in mice, is the fusion of MUC1+ tumor cells with 
dendritic cells, followed by vaccination of the mice with the fusion cells (32, 33)* This 
leads to specific MUC1 cellular immunity that is protective for tumor challenge and tumor 
treatment in mice. Because every patient is immunologically unique, this would require 
unique reagents for each patient. This approach may thus turn out to be very difficult to 
translate into mass usage because of its expense and requirement for sophisticated medical 
expertise. 

Our strategy is to use DNA vaccination to drive a cellular immune response 
against tumor cells expressing MUC1. We believe that this approach offers significant 
advantages over the other strategies listed above. First, DNA vaccines are known to 
generate strong humoral and cellular immune responses in numerous animal studies (34, 
35), and cellular responses in at least one human trial (36). Second, we believe that a 
cellular immune response, with the generation of CTLs will be the best way to eliminate 
MUCH- tumor cells. CTLs directed against a particular antigen recognize specific 
peptides presented in the context of Class I molecules on a cell surface. Recognition by 
CTL then results in destruction of the cell expressing that antigen, DNA vaccines can 
induce the generation of CTLs directed against the antigen encoded by the vaccine (34, 
35). If the antigen is a tumor antigen, tumor cells would be lysed by the CTLs, In 
contrast, anti-tumor antibodies are typically of low avidity and are not very effective in 
causing ADCC of tumor cells. Third, by injecting a plasmid that will encode the whole 
MUC1 protein, the patient's immune system can choose the best peptides for presentation 
according to his/her unique array of Class I molecules, rather than limiting the drug to one 
or several putative Class I peptides. Fourth, we have shown in preclinical studies that a 
combination of plasmids encoding MUC1 and the cytokine IL-18 protect mice from 
developing tumors, whereas plasmids encoding MUC1 or EL-18 alone offer little to no 
protection. IL-1 8 is a cytokine known to skew a nascent immune response toward a 
cellular response, rather than a humoral response (37). Fifth, DNA vaccination is a 
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flexible therapeutic strategy, in that one can design a DNA vaccine that encodes not just 
MUC1 but other molecules that could help to drive the immune response. Sixth, DNA 
vaccines are simple in concept and delivery to the patient, and should provide a cost- 
effective approach toward cancer treatment. Seventh, DNA vaccines can be administered 
indefinitely to the patient, because DNA is nontoxic, and because only the protein product 
of the DNA, not the DNA itself, is immunogenic* 

The invention is a plasmid that encodes human MUC1 and a plasmid that 
encodes human IL-18, or a multicistron plasmid that encodes both genes. The mode of 
delivery could also be MUC1 DNA and IL-18 DNA encoded by a viral vector, or RNA 
encoding each gene. The invention includes an IL-18 gene construct comprised of mature 
EL- 1 8 linked to a heterologous signal sequence, specifically an immunoglobulin signal 
sequence. This permits mature IL-18 to be expressed without the requirement for caspase 
cleavage of the IL-18 precursor protein. 

Coinjection of both MUC1 and EL- 18 plasmids intramuscularly at the same 
site is presumed to cause the local expression of both proteins in muscle cells, as well as 
the takeup and expression of both plasmids by professional antigen presenting cells 
(APCs) feat are migrating through the tissue. This leads to a memory immune response 
that is protective for animals subsequently challenged with MUC1 + tumor cells. It appears 
that the vaccination can break self-tolerance to MUC L 

The vaccination also leads to protection from subsequent challenge by 
MUC1" tumor cells that are otherwise identical to the MUC1 + tumor cells. This 
phenomenon is known as epitope spreading, and may be a critical, unique feature of the 
vaccine that enables the immune system to develop a response to MUC1 and to other 
undefined antigens expressed by the tumor Tumors are adept at evading the immune 
system, notably by changing their array of antigens on the cell surface (escape variants). 
Thus, a vaccine that induces immunity to more than one tumor antigen should make it 
more difficult for tumors to evade the immune system, and this could result in more 
effective cancer therapy. 

Our studies show that MUC1 and IL-18 plasmids synergize to induce the 
formation of a protective anti-tumor immune response. The first study was performed in 
C57B1/6 mice (43). Nine groups of animals were vaccinated with either vehicle control, 
empty vector, pMUCl, or pEL- 18, singly or in combination. Three vaccinations were 
performed over a three- week period, and the mice were challenged with syngeneic 
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MUC1 + tumor cells (38, 39) by subcutaneous injection in the fourth week. Animals were 
then monitored for tumor incidence and tumor volume for up to seven weeks thereafter. 
Results are shown in Figure 1 . None of the mice in the groups receiving vehicle, empty 
plasmid or pIL-18 were protected from developing tumors. Two groups received 
suboptimal doses of pMUCl, and only 2-3 mice were protected. Of the groups vaccinated 
with the various combinations of pMUCl and pIL-18 plasmids, those groups receiving the 
higher dose of pMUCl in combination with either dose of pBL-18 showed good protection 
(6/9 or 7/9 mice). These results are significantly different from the control results 
(p=0.011 orp=0.003). 

Tumor volume was also evaluated. The best result was seen in the group 
receiving 5ug pMUCl/5ug pIL-18, where tumor growth appeared to be delayed to day 35. 
At that time the slope of tumor growth parallels that of the other groups (Figure 2). 

Sera from the animals was collected pre-study, and at days 13, 26 and 34 
during and after vaccination. Sera were tested for the presence of anti-MUCl antibodies, 
but only low titers were seen. This result indicates that a strong anti-MUCl antibody 
response was not responsible for the protection seen in the animals. 

The surviving mice from the first phase of this study were then entered into 
a second phase, which was designed to learn if the mice had developed a protective anti- 
tumor immune response that could be recalled. The mice were subjected to a second 
challenge with MUC1 + tumor cells, with the results shown in Figure 3. Again, the group 
that originally received 5ug of each test plasmid fared well, with 4 of the original 9 mice 
protected for another 49 days, while in the group receiving 5ug pMUCl and 50ug pIL-18, 
3 of the original 9 mice were still protected. This result indicates that some of the 
rechallenged mice had developed a protective cellular immune response, because they 
were able to fend off a second challenge of tumor cells. 

The above study showed that while neither plasmid alone offered much 
protection from tumor challenge, and thus did not prime the immune response particularly 
well, vaccination with both plasmids at certain doses could indeed lead to protection from 
tumor challenge, or at least a delay in tumor development. We then sought to reproduce 
these results in a model system more reflective of the human patient, and we used a strain 
of C57B1/6 mice transgenic for human MUC1 (40-42; referred to as MUC1 Tg mice). 
This model would allow us to test if the combination of plasmids was effective, and if we 
could break tolerance to a self-antigen. We repeated the study shown above using the 
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transgenic mice and using increased doses of pMUCl, but testing the same doses of pIL- 
18. 

The results in the second study are consistent with the first (44; see Figure 
4). Animals receiving empty plasmid showed no protection from tumor challenge. Only 
one animal receiving the higher dose of pMUCl was protected, while none of those 
receiving pIL-18 alone were protected. In contrast, the groups receiving the combinations 
of pMUC 1/pDL- 1 8 showed notable protection, particularly the group receiving the highest 
dose of each plasmid (8/9 without tumors; p=0.002). 

On day 28 the tumors were excised and weighed, as shown in Figure 5. 
Neither the pMUC 1 nor pIL- i 8 groups had mean weights that were significantly different 
from the empty vector control group. However, all four pMUCl/pIH8 combination 
groups had mea tumor weights that were significantly smaller than those of the empty 
vector control group (p=0.004-Q.038). The results show that not only did the combination 
of pMUCl/pIL-18 have a positive effect on tumor incidence, it had a positive effect on 
tumor weights as well. Neither of these effects was observed with either plasmid alone. 

Mice from the combination groups were then rechallenged with MUC1 + 
tumor ceils to learn if they had developed protective immunity that could be recalled 
(Figure 6). Of the 5 mice that had originally been vaccinated with lOOug pMUCl/50ug 
pIL-18, 4/5 remained free of tumor growths in phase II after the second tumor challenge. 
Both of the mice from the group that was vaccinated with lOOug pMUCl/5ug pIL-18 also 
remained free of growths throughout the second challenge, while 1 of 2 mice each from 
the two remaining groups developed growths. The results support the hypothesis that the 
mice developed a memory response that was recalled in response to the second tumor 
challenge. 

We then determined if the mice had developed a broader immune response 
to antigens besides MUC1. The same animals in phase II were challenged again but with 
MUCr MC38 tumor cells. The MC38 cells are the parent line to the MUC1 + tumor cells, 
and are otherwise expected to be identical (38). Results of the third challenge are shown 
in Figure 7. Interestingly, the mice that were originally vaccinated with the lOOug dose of 
pMUCl in combination with either dose of pIL-18 continue to be protected, while the 
three naive control MUC1 Tg mice succumbed to tumors. This result suggests that the 
vaccinated mice have developed immunity to determinants shared between the two cell 
lines, in addition to immunity to MUC1 . This phenomenon is known as epitope 
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spreading, and is well documented in autoimmune disease models in animals (46, 47), In 
these models, animals are first immunized with a self-protein or peptide against which 
they develop immunity, and the immune response causes the destruction of normal tissue 
expressing the native protein. After tissue destruction the immune response broadens to 
include antigens that the animals were not immunized against but which are expressed by 
the target tissue. If such a process could be duplicated in humans, DNA vaccination could 
be very effective at inducing immunity to MUC1 as well as other unique determinants 
present on tumor cells, and broadening the immune response should only be helpful to 
patient therapy. In addition, tumor cells are continuously changing in response to 
environmental pressures, and therapy against one antigen could lead to remission until 
escape variants arise that no longer express that antigen. With epitope spreading, the 
immune response broadens to include other antigens and theoretically should improve the 
chances that the tumor cells will be unable to escape the vigilance of the immune system. 

A second advantage of this approach includes the use of a human IL-18 
construct that encodes the mature form of IL-1 8 linked to an immunoglobulin signal 
sequence. IL-18 is ordinarily expressed as a precursor protein that is not functional until it 
is cleaved into its mature form by caspase (48, 49). Most cells do not express caspase, 
therefore one strategy to ensure IL-18 expression in any cell type is to engineer the protein 
so that it does not require caspase cleavage for maturation. We have used a genomic 
fragment that encodes the anti-IL-12 12B75 heavy chain signal sequence (50) linked to a 
human IL-18 cDNA sequence to ensure production of human IL-18 in any cell type* This 
strategy was effective for both the human and mouse IL-18 genes. 

A third advantage of our approach is to use a MUC1 cDNA that includes 
one of its own introns to improve expression from the plasmid (Figure 9. 

A fourth advantage of our approach is the ability to encode more than one 
gene on a plasmid to enable delivery of more than one protein product to a target 
tissue/cell (51, 52). This should ensure that a target tissue expresses all desired proteins 
with the expectation of a more efficient induction of immune response, A double cistron 
vector has been constructed, and we have shown that it is capable of expressing mouse or 
human IL-1 2. IL-1 2 is a protein comprised of two subunits that must be co-expressed in 
the same cell in order for the mature molecule to be produced. The two protein subunits 
are encoded by different genes, and we have shown in tissue culture that a double cistron 
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vector encoding both genes results in more effective production of the mature protein than 
using two plasmids which encode either gene alone (51 ? 52). 
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We wished to explore the epitope spreading phenomenon further, specifically to 
learn if DNA vaccination followed by just a single tumor challenge with MUC1+ cells 
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would give rise to epitope spreading. Animals were vaccinated according to the groups 
shown in Figure 10. Vaccination with pMUCl/pIL-18 is the only regimen that results 
in significant protection (8/18 mice) compared to the empty vector group (p=0.007). 
Tumor weights are likewise significantly smaller in this group versus the other three 
groups (Figure 11), These results confirm the previous data demonstrating that the 
combination of pMUCl and pIL-18 offer better protection against tumor challenge, and 
also cause a significant reduction in tumor weight in those animals that still develop 
tumors. Further, the data indicate that the combination of the two plasmids allows one 
to break tolerance to the MUC1 self antigen in fee MUC1 transgenic mice. 

The 8 protected mice from the pMUCl/pIL-1 8 group, and the 3 protected mice 
from the pMUCl-only group were challenged with MUC1' tumor cells (Figure 12). 
Only 1/15 control naive animals survived tumor challenge, whereas 4/8 and 2/3 
vaccinated animals remained tumor free. This result indicates that epitope spreading 
occurs with the immune response generated by the DNA vaccination and the first tumor 
challenge. Further, the fact that epitope spreading occurs in the pMUCl-only group 
suggests that lL-18 may not be required for this phenomenon to occur. 
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Figure 10. Tumor incidence in female MUC1 transgenic mice vaccinated with DNA as 
indicated in the legend, and subsequently challenged with MUC1* tumor cells. Only 
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the group vaccinated with pMUCl/pIL-18 shows significantly improved protection 
from tumor challenge (p=MX007). 
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Figure 1 1 . Media tumor weights at study end, from animals shown in Figure 1 , Media 
tumor weight for group 4 is significantly different from those in the other groups. 
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pMUC1 100ug 
pMUC1/plL-18100/50ug 



Naive 



0 5 10 15 20 25 30 

day after seeding 

Figure 12. Rechallenge of protected mice from Figure 1 with MUC1" tumor cells. 

Experimental conditions for above: Female MUC1 transgenic mice were vaccinated in 
Figure 12 with the indicated quantities of plasmids, on day 0, 14, and 21. Mice were 
challenged with 1 .5x1 0 5 MIS A cells on day 28. They were monitored for tumor 
incidence, and tumor weights were measured at study end (Figure 11). The surviving 
mice from Figure 1 1 were challenged with 3xl0 5 MC38 cells 45-47 days after the 
initial tumor challenge (Figure 12). 

Tumor protection studies in male MUC1 transgenic mice 

We have tested whether vaccination of male MUC1 transgenic mice with pMUCl 
plasmid can induce a protective immune response upon challenge with MISA cells, 
Male mice were vaccinated on day 0, 14 and 21 with various doses of DNA, then 
challenged on Day 28 with 1.5xl0 5 MISA tumor cells (Figure 13). In the control 
group, nearly all mice (9/10) succumbed to tumors. Male mice vaccinated with 150ug 
of pMUCl showed good protection (6/10; p=0.019), and mice vaccinated with lOOug 
pMUCl showed protection in 3/9 mice (not significant). Lower doses of pMUCl did 
not result in any tumor protection. It appears that the pMUCl plasmid alone can offer 
significant benefit in reducing tumor incidence, at high dose. 
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Tumor weights are shown in Figure 14. Again, the tumor weights in the highest 
dose group show a significant difference from the control group (p^0.015). This result 
suggests that the vaccination also helps to control growth of the tumor cells in the mice 
that still develop tumors. 

To learn if the anti-tumor response was long-lived, the male mice that did not 
develop tumors (Figure 13) were rechallenged with L5xl0 5 MIS A cells on day 39 after 
the first tumor challenge. As shown in Figure 15, 3/6 and 1/3 of the pMUCl 
vaccinated mice remained protected after the rechallenge, suggesting that some animals 
did develop a long-lived recall response to the tumors. 
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empty vector 150ug 
pMUCl 150ug 
pNIUCI 100ug 
pMUCl 30ug 
pMUCl 10ug 
pMUCl 3ug 



Day after seeding 

Figure 13. Tumor incidence in male mice vaccinated with pMUCl or empty vector, 
followed by tumor challenge. 
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Figure 14. Tumor weights in male mice vaccinated with pMUCl. 
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Days after rechalienge with MISA cells 



Figure 15* Tumor incidence in male mice rechallenged on the opposite flank with 
MUC1+ tumor cells. 



The present invention is not to be limited in scope by the specific embodiments described 
herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description and 
the accompanying figures. Such modifications are intended to fall within the scope of the 
appended claims. 
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It is further to be understood that all base sizes or amino acid sizes, and all molecular 
weight or molecular mass values, given for nucleic acids or polypeptides are approximate, 
and are provided for description. 

Various publications are cited herein, the disclosures of which are incorporated by 
reference in their entireties. 
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WHAT IS CLAIMED IS: 

1. A nucleic acid vaccine, comprising 

(a) at least one polynucleotide encoding at least one antigenic portion of at 
least one amino acid sequence comprising or encoded by at least one of 
SEQ ID NOS:l-47 or variants thereof, or a nucleic sequence 
complementary thereto; and 

(b) at least one polynucleotide encoding at least one adjuvant encoding 
portion of at least one amino acid sequence comprising or encoded by at 
least one of SEQ ID NOS:60-77 or variants thereof, or a sequence 
complementary therero* 

2. A nucleic acid vaccine according to claim 1, wherein said antigen is selected from at 
least one of MUC-1, PSA, or KXK2. 

3. A nucleic acid vaccine according to claim 2, wherein said MUC-1 amino acid 
sequence is selected from at least one antigenic portion of at least one of SEQ ID 
NOS:20, 22, 26, 28, 30, 32, 34, 35, 37, 39, 41, 43, and 47. 

4. A nucleic acid vaccine according to claim 2, wherein said PSA amino acid sequence is 
selected from at least one antigenic portion of at ieast one of SEQ ID NOS:l, 4-10, 12 
and 14-15. 

5. A nucleic acid vaccine according to claim 2, wherein said IL-18 amino acid sequence 
is selected from at least one antigenic portion of at least one of SEQ ID NOS:64, 65, 
69, 70-71, 74-75 and 77. 

6. A nucleic acid vaccine according to claim 1 , wherein the vaccine further comprises at 
least one promoter sequence controlling the expression of said antigen encoding 
polynucleotide. 

7. A nucleic acid vaccine according to claim 2, wherein the promoter is at least one 
cytomegalovirus immediate early (CMV) promoter. 
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8. A nucleic acid vaccine according to claim 2 S wherein the promoter is at least one 
dihydrofoliate reductase (dhfr) promoter. 

9. A nucleic acid vaccine according to claim 2, where the promoter is at least one early 
or late SV40 promoter. 

10. A nucleic acid vaccine according to claim 1, comprised of a nucleic acid vector. 
1L A nucleic acid vaccine according to claim 1, comprised of a host cell 

12. A nucleic acid vaccine according to claim 1, comprised of viral vector. 

13. A composition comprising a nucleic acid vaccine according to claim 1. 

14. A tumor/adjuvant vaccine composition comprising a nucleic acid vaccine according to 
claim 1 and a pharmaceutically acceptable carrier or diluent, 

15. A nucleic acid vaccine composition of claim 11, further comprising an additional adjuvant 
and/or cytokine encoding sequence or component of the composition which enhances a 
nucleic acid vaccine immune response to at least one cancer associated tumor protein in a 
mammal administered the vaccine composition. 

16. A method for eliciting an immune response to a cancer associated tumor protein in a 
mammal that is prophylactic for a cancer associated tumor protein, comprising 
administering to a mammal a nucleic acid vaccine according to claim 1 . 

17. A method for eliciting an immune response to a cancer associated tumor protein in a 
mammal for therapy of a tumor-associated pathology, comprising administering to a 
mammal a nucleic acid vaccine according to claim 1 . 

18. A method according to claim 13, further comprising priming or boosting a humoral or 
cellular immune response, or both, by administering an effective amount of at least one of 
said nucleic acid vaccine. 

r 

19. A method according to claim 14, further comprising priming or boosting a humoral or 
cellular immune response, or both, by administering an effective amount of at least one of 
said nucleic acid vaccine. 
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properties one from the others. Similarly, SEQ ID NOs: 60-77 lack the same or corresponding special technical feature because each 
SEQ ID NO. is different one from the others depending upon whether the IL-18 sequences contain signal sequences, introns, variants 

having different IL-18 activity and derived from different sources (e.g., human rhesus macaque"). 
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